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PREFACE 


Combinatorial reasoning underlies all analysis of computer systems. It 
plays a similar role in discrete operations research problems and in finite 
probability. Two of the most basic mathematical aspects of computer science 
concern the speed and logical structure of a computer program. Speed involves 
enumeration of the number of times each step in a program can be performed. 
Logical structure involves flow charts, a form of graphs. Analysis of the speed 
and logical structure of operations research algorithms to optimize efficient 
manufacturing or garbage collection entails similar combinatorial mathematics. 
Determining the probability that one of a certain subset of equally likely out- 
comes occurs requires counting the size of the subset. Such combinatorial 
probability is the basis of many nonparametric statistical tests. Thus, enumera- 
tion and graph theory are used pervasively throughout the mathematical sci- 
ences. 

This book teaches students in the mathematical sciences how to reason 
and model combinatorially. It seeks to develop proficiency in basic discrete 
math problem solving in the way that a calculus textbook develops proficiency 
in basic analysis problem solving. 

The three principal aspects of combinatorial reasoning emphasized in this 
book are: the systematic analysis of different possibilities, the exploration of 
the Jogical structure of a problem (e.g., finding manageable subpieces or first 
solving the problem with three objects instead of n), and ingenuity. Although 
important uses of combinatorics in computer science, operations research, and 
finite probability are mentioned, these applications are often used solely for 
motivation. Numerical examples involving the same concepts use more interest- 
ing settings such as poker probabilities or logical games. 

Theory is kept to a minimum. It is always first motivated by examples, 
and proofs are given only when their reasoning is needed to solve applied 
problems. Elsewhere, results are stated without proof, such as the form of 
solutions to various recurrence relations, and then applied in problem solving. 
Occasionally, a few theorems are stated simply to give students a flavor of 
what the theory in certain areas is like. 

The Mathematical Association of America recently formed a Panel on a 
General Mathematical Sciences Program to redefine the MAA’s model mathe- 
matical major to be a mathematical sciences major. This panel recommended 
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that all majors should study combinatorial problem solving. The discrete struc- 
tures course in the Association of Computing Machinery’s Curriculum 78 covers 
set theory, Boolean algebra, propositional logic, some modern algebra, and 
basics of counting and graph theory. This course is far too ambitious and has 
become a source of controversy. The limited amount of applied set theory, 
modern algebra, and logic in this book would be sufficient mathematical prepara- 
tion in these areas for most computer science students, when supplemented 
with weekly practice in induction proofs (see Appendix A.2). More importantly, 
basic skills in combinatorial reasoning and problem solving are, to this author, 
more fundamental mathematics for computer scientists. 

This book is designed for use by students with a wide range of ability and 
maturity (sophomores through beginning graduate students). The stronger the 
students, the harder the exercises that can be assigned. The book can be used 
for a one-quarter, two-quarter, or one-semester course depending on how much 
material is used. It may also be used for a one-quarter course in applied graph 
theory or a one-semester or one-quarter course in enumerative combinatorics 
(starting from Chapter 5). A typical one-semester undergraduate discrete meth- 
ods course should cover most of Chapters 1 to 3 and 5 to 8, with selected 
topics from other chapters if time permits. 

Instructors are strongly encouraged to obtain a copy of the instructor’ s 
guide accompanying this book. The guide has an extensive discussion of com- 
mon student misconceptions about particular topics, hints about successful 
teaching styles for this course, and sample course outlines (weekly assignments, 
tests, etc.). 

The third edition of this book, like the second edition, puts the graph theory 
chapters first, as recommended by the MAA Mathematical Sciences Panel. This 
edition has additional examples, expanded discussions, a glossary of graph 
theory terms, extended solutions, and additional exercises throughout the text. 

Many people gave useful comments about early drafts and the first edition 
of this text; Jim Frauenthal and Doug West were especially helpful. The idea 
for this book is traceable to a combinatorics course taught by George Dantzig 
and George Polya at Stanford in 1969, a course for which I was the grader. 
Ultimately, my interest in combinatorial mathematics and in its effective teach- 
ing rests squarely on the shoulders of my father, A. W. Tucker, who has long 
sought to give finite mathematics a greater role in mathematics as well as in 
the undergraduate mathematics curriculum. Finally, special thanks go to former 
students of my combinatorial mathematics courses at Stony Brook. It was they 
who taught me how to teach this subject. 


Alan Tucker 
Stony Brook, New York 
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CHAPTER 1 


ELEMENTS OF GRAPH 
THEORY 


1.1 GRAPH MODELS 


The first four chapters deal with graphs and their applications. A graph G = 
(V, E) consists of a finite set V of vertices and a set E of edges joining different 
pairs of distinct vertices. Figure 1.1a shows a depiction of a graph with V = 
{a, b, c,d} and E = {(a, b), (a, c), (a, d), (b, d), (c, d)}. We represent vertices 
with points and edges with lines joining the prescribed pairs of vertices. We 
do not allow two edges to join the same two vertices. Also an edge cannot 
“‘loop’’ so that both ends terminate at the same vertex. The two ends of an 
undirected edge can be written in either order, (b, c) or (c, b). We say that 
vertex a is adjacent to vertex b when there is an edge from a to b. 

Sometimes the edges are ordered pairs of vertices, called directed edges. 
In a directed graph, all edges are directed. See the directed graph in Figure 
1.1b. We write (b; c) to denote a directed edge from b to c. In a directed graph, 
we allow one edge in each direction between a pair of vertices. See edges (a; 
c) and (c; a) in Figure 1.15. 

Graphs have proven to be an extremely useful tool for analyzing situations 
involving a set of elements in which various pairs of elements are related by 
some property. The most obvious examples of graphs are sets with physical 
links, such as electrical networks, where electrical components (transistors) are 
the vertices and connecting wires are the edges; or telephone communication 
systems, where telephones and switching centers are the vertices and telephone 
lines are the edges. Road maps, oil pipelines, and subway systems are other 
examples. 

Another natural form of graphs is sets with logical or hierarchical sequenc- 
ing, such as computer flow charts, where the instructions are the vertices and 
the logical flow from one instruction to possible successor instruction(s) defines 
the edges; or an organizational chart, where the people are the vertices and if 
person A is the immediate superior of person B then there is an edge (A, B). 
Computer data structures, evolutionary trees in biology, and the scheduling of 
tasks in a complex project are other examples. 

Any of the preceding examples can generate enough gtaph-related questions 
for a whole book. Some of these questions can be settled quickly by inspection 
of the associated graph; some require general mathematical theory; and some 
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Figure 1.1 (a) (b) 


require very application-specific analysis where the graph models are only of 
limited conceptual help. In the chapters on graph theory, we will survey instances 
of all three types of questions. 

The emphasis will be on problem solving, with problems about general 
graphs and applied graph models. Our problem solving will rely on logical 
reasoning and mathematical insights. Observe that we will not have any numbers 
to work with, only some vertices and edges. At first, this may seem to be 
highly nonmathematical. It is certainly very different from the mathematics 
that students learn in high school or in calculus courses. However, disciplines 
such as computer science and operations research contain as much graph theory 
as they do standard numerical mathematics. 

The rest of this section consists of a collection of illustrative examples 
about graphs. We will solve each problem from scratch with a little logic and 
systematic analysis. Many of these examples will be revisited in greater depth 
in subsequent chapters. The following two terms will arise in some of the 
examples. A path is a sequence of distinct vertices (X|, X2, ..-, X,) such that 
consecutive vertices are adjacent, that is, there is an edge from each vertex x; 
to the next vertex x;,; in the path sequence. For example, in Figure 1.1a, (b, 
d, a, c) is a path. A circuit is a path that ends where it starts, that is, x, = % 
(technically, this is repeating a vertex, but this special repetition is allowed). 
In Figure 1.1a, (a, b, d) is a circuit. 


Example 1 


Suppose that we have five people A, B, C, D, E and five jobs a, b, c, d, e, and 
various people are qualified for various jobs. The problem is to find a feasible 
one-to-one matching of people to jobs, or to show that no such matching can 
exist. We can represent this situation by a graph with vertices for each person 
and for each job, with edges joining people with jobs for which they are 
qualified. Does there exist a feasible matching of people to jobs for the graph 
in Figure 1.2? : 

The answer is no. The reason can be found by considering people A, B, 
and D. These three people as a set are collectively qualified for only two jobs, 
c and d. Hence there is no feasible matching possible for these three people, 
much less all five people. An algorithm for finding a feasible matching, if any 
exists, will be presented in Chapter 4. Such matching graphs in which all the 
edges go horizontally between two sets of vertices are called bipartite. Bipartite 
graphs are discussed further in Section 1.3. = 
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Suppose the graph in Figure 1.3 represents a network of telephone lines (or 
electrical transmission lines). We are interested in the network’s vulnerability 
to accidental disruption. We want to identify those lines and switching centers 
that must stay in service to avoid disconnecting the network. 

There is no telephone line (edge) whose removal will disconnect the tele- 
phone network (graph). Similarly, there is no vertex whose removal disconnects 
the graph. 

Is there any pair of edges whose removal disconnects the graph? After 
inspection, we see that if the two edges incident to a are removed, vertex a is 
isolated from the rest of the network. A similar result occurs if the two edges 
incident to f are removed. Are there any other sets of two edges that disconnect 
the graph (the answer is left as an exercise)? 

Let us take a different tack. Suppose we want to find a minimal set of 
edges needed to link together the six vertices in Figure 1.3. There are several 
possible minimal connecting sets of edges. By inspection, we find the following 
one: (a, b), (b, c), (c, d), (d, e), (d, f). In Chapter 3 we discuss such minimal 
connecting sets of edges further. One interesting general result about these sets 
is that if the graph G has n vertices, then a minimal connecting set for G (if 
any exists) always has n — | edges. m= 


The minimal connecting sets mentioned in Example 2 are called trees. This 
important class of graphs is the subject of Chapter 3. The next example uses 
trees in a different way. 


Example 3 


Consider the problem of identifying an unknown control word X in a compiler. 
Suppose the possible control words are ADD, DO, FILL, GET, REPLACE, 
STORE, and WAIT. In a computer, words are actually numbers. We can test X 


Figure 1.3 c f 
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in a three-way branch (less than, or equal to, or greater than) against any control 
word. Devise an efficient scheme of three-way branch tests to identify X. (By 
efficient, we mean a scheme that minimizes the maximal number of tests that 
might be needed to identify X.) 

Figure 1.4 displays an efficient scheme in the form of a graph. A left 
downward edge from a vertex represents a “‘less than’’ answer to a three-way 
branch and a right downward edge from a vertex represents a ‘‘greater than’’ 
answer. We start by testing X against GET. Depending on the answer, we know 
either: (a) X = GET, or (b) X < GET and now we test X against DO, or (c) 
X > GET, and we test X against STORE. After at most one more test, this 
testing scheme will have identified X. 

The testing scheme of successively checking the unknown X against each 
of the seven possible words could require six tests, as opposed to two tests 
with this tree. When the set of possible matches becomes large, as in a spelling 
checking program (with 50,000 or more recognizable words), the efficiencies 
of tree-based searching are essential. = 


Example 4 


Consider the section of a city’s street map represented by the graph in Figure 
1.5. We want to position police at corners (vertices) so that they can keep every 
block (edge) under surveillance, that is, every edge should have police at (at 
least) one of its end vertices. What is the fewest number of police that can do 
this job? 

The map has 14 blocks (edges). Corners b, c, e, f, h, and j each are at the 
end of 3 edges, and corners a, d, g, i, and k each at the end of 2 edges. Since 
4 vertices can be incident to at most 4 X 3 = 12 edges but there are 14 edges 
in all, we will need at least five police. If all five police were positioned at 3- 
edge vertices, then 5 X 3 = 15 edges are watched by the 5 police. Since there 
are only 14 edges, some edge would be covered by police at both end vertices. 
If four police are at 3-edge vertices and one at a 2-edge vertex, then exactly 
14 edges are watched—and no edge need be covered at both ends. (If fewer 
than 4 of the 5 police are at 3-edge vertices, we could not watch all 14 edges). 
With these general observations, we are ready for a systematic analysis to try 
to find 5 vertices that are collectively adjacent to all 14 edges. 

Consider edge (c, d). Suppose it is watched by an officer at vertex d. Then 
vertex c (the other end vertex of edge (c, d)) cannot also have an officer, since 
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we noted above that if we use a 2-edge vertex, such as d, then no edge can be 
watched from both end vertices. However, if vertex c cannot be used, then 
edge (c, g) must be watched from its other end vertex g. But now we are using 
two 2-edge vertices, d and g. We noted above that at most one of the five 
police can be placed at a 2-edge vertex. We got into this trouble by assuming 
that edge (c, d) is watched from vertex d. 

Now assume no officer is at vertex d. Then we must watch edge (c, d) 
with an officer at vertex c. Since vertex d cannot be used, edge (d, h) can be 
watched only by placing an officer at vertex h. Next look at edge (h, &). It is 
already watched by vertex h. Then we assert that (h, &) cannot also be watched 
by an officer at vertex k, since k has just 2 edges and we noted above that if 
we use a 2-edge vertex, no edge can be watched from both ends. We conclude 
that there cannot be an officer at vertex k. Then edge (k, j) can be watched 
only by placing an officer at vertex j. We now have officers required to be at 
vertices c, h, and j. 

Similar reasoning shows that with an officer at vertex j, there cannot be 
an officer at vertex i; then there must be an officer at vertex e; there cannot 
be an officer at vertex a; and there must be an officer at vertex b. In sum, we 
have shown that we should place police at vertices c, h, j, e, and b. A check 
shows that these five vertices do indeed watch all 14 edges. Since our reasoning 
forced us to use these five vertices, no other set of five vertices can work. 

We conclude this example by noting that in this surveillance situation, one 
can also consider watching the vertices rather than the edges: How few officers 
are needed to watch vertices? An officer at vertex x is considered to be watching 
vertex x and all vertices adjacent to x. There are 11 vertices, and 6 of these 
vertices watch 4 vertices (themselves and 3 adjacent vertices). Thus three is 
the theoretical minimum. This minimum can be achieved. Details are left as 
an exercise. = 


A set C of vertices in a graph G with the property that every edge of G is 
incident to at least one vertex in C is called an edge cover. The previous 
example was asking for an edge cover of minimal size in Figure 1.5. The 
reasoning in Example 4 illustrates the kind of systematic case-by-case analysis 
that is common in graph theory. 

One goal of graph theory is to find relationships between seemingly unre- 
lated graph concepts that arise from different settings. The following example 
introduces a new concept and then shows an unexpected relation to edge covers. 
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Example 5 


Consider the following scheduling problem. A state legislature has many com- 
mittees that meet for one hour each week. One wants a schedule of committee 
meeting times that minimizes the total number of hours but such that two 
committees with overlapping membership do not meet at the same time. 

This situation can be modeled with a graph in which we create a vertex 
for each committee and join two vertices by an edge if they represent committees 
with overlapping membership. Suppose that the graph in Figure 1.6 represents 
the membership overlap of 10 legislative committees. For example, vertex c’s 
edges to vertices a, d, and f in Figure 1.6 indicate that committee c has overlap- 
ping members with committees a, d, and f. 

A set of committees can all meet at the same time if there are no edges 
between the corresponding set of vertices. A set of vertices without an edge 
between any two is called an independent set of vertices. Our scheduling 
problem can now be restated as seeking a minimum number of independent 
sets that collectively include all vertices. This problem is discussed in depth 
in Section 2.3. 

If we want to know how many committees can meet at one time, we are 
asking the graph question: What is the largest independent set of the graph? 
Although it is very hard in general to find the largest independent set in a 
graph, for the graph in Figure 1.6, a little examination shows that there are 
two independent sets of size 4, a, e, f, h and b, c, g, h. All other independent 
sets have three or fewer vertices. 

Now for the unexpected link with edge covers. If V is the set of vertices 
in a graph G, then J will be an independent set of vertices if and only if V ~ 
/ is an edge cover! Why? Because if there are no edges between two vertices 
in /, then every edge involves (at least) one vertex not in /, that is, a vertex in 
V — I. Conversely, if C is an edge cover so that all edges have at least one 
end vertex in C, then there is no edge joining two vertices in V — C. So V — 
C is an independent set. Check that in Figure 1.6, the vertices not in the 
independent set {a, e, f, h} form an edge cover. Note also that in the surveillance 
graph in Figure 1.5, the vertices not in our edge cover {b, c, e, h, J} form an 
independent set. 

A consequence of this relationship is that if J is an independent set of 
largest possible size in a graph, then V — J will be an edge cover of smallest 
possible size. So finding a maximal independent set is equivalent to finding a 
minimal edge cover. = 


Figure 1.6 c f i 
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Instead of modeling a geometric or other problem with a graph, it is possible 
to model a graph with a geometric problem. A graph G is called an interval 
graph if there is a one-to-one correspondence between the vertices of G and 
a collection of intervals on the line so that two vertices of G are adjacent when 
the corresponding intervals overlap. We now consider two problems in which 
we model a real-world situation with a graph and then seek to model the 
resulting graph with an overlapping family of intervals. In Section 2.3 an 
interval graph model arises in VLSI (Very Large Scale Integrated) circuitry 
design. 


Example 6 


A competition graph, used in ecology, has a vertex for each of a given set of 
species and an edge joining each pair of species that feed on a common prey 
(i.e., the two species compete for food). See the competition graph in Figure 
1.7a. Ecologists have given considerable attention to viewing competition 
graphs as interval graphs for the following reason (see reference [2] for further 
details). Each species is thought of having a ‘‘niche’’ in its ecological system, 
its position in terms of food it eats and land where it lives and similar attributes. 
A niche may be regarded as a point in some suitable multidimensional parameter 
space. 

We try to associate an interval with each species (vertex) such that compet- 
ing species (adjacent vertices) correspond to overlapping intervals. An interval 
is a geometric realization of a species’ niche and an interval graph model of 
a competition graph implies that the overall ecological system can be viewed 
in a simple, one-dimensional space. 

Do the ecological niches represented in the competition graph in Figure 
1.7a overlap in a linear fashion? That is, is the graph in Figure 1.7a an interval 
graph? 

The answer is no. A little thought shows that there is no way that four 
overlapping intervals can model the adjacencies among the four vertices a, g, 
e, f, which form a circuit in Figure 1.7a. Figure 1.7b shows that one of the 
intervals would have to wrap around to model the required adjacencies. = 


Example 7 


The following study was undertaken 30 years ago to determine the structure 
of genes. A gene is a piece of DNA molecule that instructs a cell how to build 
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Figure 1.7 
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(e) 
Figure 1.7 (continued) 


a particular protein. A protein molecule can have a variety of shapes: a linear ~ 
sequence or circle or pretzel-like arrangement of amino acids (the building 
blocks of proteins), and much more complicated forms. At the time, genes 
were known to be linked together in a long string of thousands of genes to 
form a DNA molecule. But initially researchers suspected that the structure of 
DNA in individual genes might mirror the complicated structure of the proteins 
they define. 

The following experiment was run by S. Benzer to examine the structure 
of one gene in one particular virus. He obtained dozens of mutations of the 
gene, each involving a different ‘‘blemish’’ in the gene (a ‘‘blemish’’ being a 
connected subsection of altered genetic information). Although Benzer did not 
know where these blemishes lay within the gene, he could distinguish one 
blemish from another (different blemishes affected the virus’s growth in differ- 
ent ways) and, through carefully designed experiments, could tell whether or 
not two blemishes overlapped. He wanted to see if the pattern of overlaps could 
be explained by assuming that the gene’s DNA had the structure of a long line 
with the blemishes being overlapping intervals on this line. 

Suppose that the graph in Figure 1.7c represents some such overlap informa- 
tion. Each vertex stands for a different mutation, arising from a specific blemish. 
An edge joins two vertices when the two associated blemishes overlap. Benzer 
was experimentally able to deduce the form of such an overlap graph. The 
question now is, can one create a set of intervals on a line, associating each 
interval with a vertex of this graph, so that two intervals overlap if and only 
if the corresponding vertices are adjacent? The overlaps among vertices a, b, 
d, f, which form a 4-vertex path, can be modeled only by a set of four intervals 
I,, 15, 1a, 1; arranged as shown in Figure 1.7d (or its mirror image). With a little 
trial and error, we build the collection of intervals shown in Figure 1.7e, whose 
overlaps reflect exactly the adjacencies of the graph in Figure 1.7c. 

In Benzer’s experiment, an interval model existed. The absence of an 
interval model would have been very strong evidence that the structure of a 
gene was not line-like but a more complicated structure, probably similar to 
the form of the protein defined by the gene. = 


We next give an example involving directed graphs. 
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Example 8 


Suppose psychological studies of a group of people determine which members 
of the group can influence the thinking of others in the group. We can make 
a graph with a vertex for each person and a directed edge (p;, p2) whenever 
person p, influences p,. Let the graph in Figure 1.8a represent a set of such 
influences. Now let us ask for a minimal subset of people who can spread an 
idea through to the whole group, either directly or by influencing someone who 
will influence someone else and so forth. In graph-theoretic terms, we want a 
minimal subset of vertices with directed paths to all other vertices (a directed 
path from p, to p;, is an edge sequence (p:; p2), (P2; Ps) - -- (Prt, Px). Such a 
subset of influential vertices is called a vertex basis. 

To aid us, we can build a directed-path graph for the original graph with 
the same vertex set and with a directed edge (p;, p;) if there is a directed path 
from p; to p; in the original graph. Figure 1.8b shows the directed-path graph 
for the graph in Figure 1.84. Now our original problem can be restated as: Find 
a minimal subset of vertices in the new graph with edges directed to all other 
vertices. This is just a directed-graph version of the vertex-covering problem 
mentioned at the end of Example 4. Observe that any vertex in Figure 1.8) 
with no incoming edges must be in this minimal subset (since no other vertices 
have edges to it); vertex b is such a vertex. Since b has edges to a, c, and d, 
then e, f, and g are all that remain to be ‘‘influenced.’’ Either e, f, or g 
‘tinfluence’’ these three vertices. Then b,; e, or b, f, or b, g are the desired 
minimal subsets of vertices. = 


We conclude this section with an example of an important graph-based 
technique for project analysis called PERT (Program Evaluation and Review 
Technique). In this graph, it is the edges, rather than the vertices, that represent 
activities. 


Example 9 


Consider a curtain-making company’s project of starting a new production and 
sales operation in a foreign country. The key tasks might be: A—market studies 


(a) (b) 
Figure 1.8 
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Start 


Figure 1.9 Finish 


to assess the demand for various types of curtains; B—hiring and training plant 
employees; C—renting or building the plant; D—hiring and training sales 
personnel; E—renting or building showrooms; and F—initial production (start- 
up) of curtains. We can model the project with a graph having an edge for 
each task and vertices to link edges according to task precedences. Suppose 
for this project that the appropriate task graph is the graph in Figure 1.9. 

To each task, one can assign a number indicating how many months are 
required to complete the task. Actually, the standard procedure is to use three 
numbers: a reasonable estimate, a best possible time, and a worst possible time. 
See Table 1.1. Using these numbers, one can calculate the maximum duration 
of the project under various assumptions about how many tasks require worst- 
case times, and so forth. One also tries to identify “‘bottleneck’’ tasks, for 
which the estimated or worst-case time adversely affects the project duration, 
that is, the initiation of many tasks depends on the completion of a ‘‘bottleneck’’ 
task (once identified, extra manpower can be allocated to ‘‘bottlenecks’’ to 
speed their completion). 

If only one task takes the worst-case time (and other tasks are completed 
in the estimated times), what is the longest time that might be required to finish 
the project—the length of the longest path from Start to Finish? 

By inspection, we see that the longest project duration occurs when task 
C takes its worst-case time of 12 months. Then the task sequence A~C—F will 
require 2 + 12 + 4 = 18 months. = 


Table 1.1 

Task 
Duration A B C D E F 
Estimate 2 5 4 3 4 4 
Best case 2 2 1 1 2 4 
Worst case 3 8 12 4 15 5 
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1.1 EXERCISES 


a 


ple 


ummary of Exercises The first seven exercises involve sim- 
graph models. Exercises 8-37 present examples and extensions of the 


models presented in the examples in this section. Exercises 38-40 involve 
other new graph models. 


1. 


Suppose interstate highways join the six towns A, B, C, D, E, F as follows: 
I-77 goes from B through A to E; I-82 goes from C through D then through 
B to F; I-85 goes from D through A to F; I-90 goes from C through E to 
F; and I-91 goes from D to E. 


(a) Draw a graph of the network with vertices for towns and edges for 
segments of interstates linking neighboring towns. 


(b) What is the minimum number of edges whose removal prevents travel 
between some pair of towns? 


(c) Is it possible to take a trip starting from town C that goes to every 
town without using any interstate highway for more than one edge (the 
trip need not return to C)? 


(a) Suppose four teams, the Aces, the Birds, the Cats, and the Dogs, play 
each other once. The Aces beat all three opponents except the Birds. 
The Birds lost to all opponents except the Aces. The Dogs beat the 
Cats. Represent the results of these games with a directed graph. 


(b) A ranking is a listing of teams such that the ith team on the list beat 
the (i + 1)st team. Find all rankings for part (a). 


. (a) A schedule is to be made with five football teams. Each team is to 


play two other teams. Explain how to make a graph model of this 
problem. 


(b) Show that except for interchanging names of teams, there is only one 
possible graph in part (a). 


. Suppose there are six people—John, Mary, Rose, Steve, Ted, and Wendy— 


who pass rumors among themselves. Each day John talks with Mary and 
Wendy; Mary talks with John, Rose, and Steve; Rose talks with Mary, 
Steve, and Ted; Steve talks with Mary, Rose, Ted, and Wendy; Ted talks 
with Rose, Steve, and Wendy; and Wendy talks with John, Steve, and Ted. 
Whatever people hear one day they pass on to others the next day. 


(a) Model this rumor-passing situation with a graph. 


(b) How many days does it take to pass a rumor from John to Steve? Who 
will tell it to Steve? 


(c) Is there any way that if two people stopped talking to each other, then 
it must take three days to pass a rumor from one person to all the 
others? 
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. (a) Give a direction to each edge in Figure 1.3 so that there are directed 


routes from any vertex to any other vertex. 

(b) Do part (a) so as to minimize the length of the longest directed path 
between any pair of vertices. Explain why a smaller minimum is not 
possible. 


. (a) In Figure 1.8b, what is the minimum number of edges whose directions 


must be reversed to yield directed routes from any vertex to any other 
vertex? 

(b) In Figure 1.8a, is it possible by reversing some edge directions to 
produce a graph with directed paths from any vertex to any other 
vertex? 


. The flow chart below displays the logical flow possibilities in a program. 


One possible execution sequence is ABDFGBCEFGH. 


(a) How many different execution sequences are possible? 
(b) How many execution sequences are there with 15 steps (vertices)? 


. In Example 2, find a minimal connecting set of edges containing neither 


(a, b) nor (b, d). 


. (a) Are there any other sets of 2 edges whose removal disconnects the 


graph in Figure 1.3 besides (a, b), (a, c) and (d, f), (e, f)? Hither 
produce others or give an argument why no others exist. 

(b) Find all sets of 2 vertices whose removal disconnects the remaining 
graph in Figure 1.3. 

Find all sets of 2 edges whose removal disconnects the graph in Figure 

1.5, 


(a) For the graph below, find all sets of 2 vertices whose removal discon- 
nects the graph of remaining vertices. 


(b) Find all sets of 2 edges whose removal would disconnect the graph. 
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(c) What is the minimal number of vertex deletions required to disconnect 
the graph formed by the 12 edges of a cube? 


12, Give another reason why Figure 1.2 has no matching by considering the 
appropriate subset of jobs (showing they cannot all be filled). 


13. Find a matching or explain why none exists for the following graphs: 


(a) A a (b) A a 
B b B b 
Cc c Cc c 
D d D d 


14, Find all matchings in Exercise 13(a). 


15. We generalize the idea of matching in Example 1 to arbitrary graphs by 
defining a matching to be a pairing off of adjacent vertices in a graph. For 
example, one possible matching in Figure 1.3 is a-b, c-d, e-f. Which of 
the following graphs have a matching? If none exists, explain why. 


(a) Figure l.la (b) Figure 1.5 
(c) : 
b d 
a ; e 
h f 


16. (a) Suppose a dictionary in a computer has a ‘‘start’’ from which one can 
branch to any of the 26 letters, and at any letter one can go to the 
preceding and succeeding letters. Model this data structure with a 
graph. 

(b) Suppose additionally that one can return to ‘‘start’’ from letters c or 
k or t. Now what is the longest path between any two letters? 


17. Build an efficient three-way branching scheme as in Example 3 to identify 
one of the 26 letters of the alphabet. 
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18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 
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(a) Repeat Example 3 with a two-way branch (less than; greater than or 
equal to). 

(b) In general, how many more rounds of testing will a two-way scheme 
for n words require over a three-way scheme (you do not need to 
justify your answer rigorously)? 


Suppose eight current varieties of chipmunk evolved from a common ances- 
tral strain through an evolutionary process in which at various stages one 
ancestral variety split into two varieties (none of the ancestral varieties 
survive when they split into two new varieties). 


(a) Explain how one might model this evolutionary process with a graph. 
(b) What is the total number of splits that must have occurred? 


In Figure 1.5, find all sets of three corners that have all 11 corners under 
surveillance. Give a careful logical analysis. 


Repeat Example 4 for minimal block and corner surveillance when the 
network in Figure 1.5 is altered by adding edges (/, g), (g, j) and deleting 
(b, f). 

Repeat Example 4 for the edge cover and minimal corner surveillance 
when the network is formed by a regular array of north-south and east— 
west streets of size: 


(a) 3 by 3 (b) 4 by 4 (ce) 5 by 5 


(a) A queen dominates any square on a chessboard in the same row, 
column, or diagonal as the queen. How few queens can dominate all 
squares on a 8 by 8 chessboard? 


(b) Repeat this problem for bishops, which dominate only diagonals. 


Solve the committee scheduling problem for the committee overlap graph 
in Figure 1.6. That is, what is the minimum number of independent sets 
needed to cover all vertices? 


(a) Find a maximum independent set in the following graphs: 
(i) Figure 1.la (ii) Figure 1.3 (iii) Figure 1.4 

(b) Use your result in part (a) to produce a minimal edge cover in these 
graphs. 


(a) Using a model similar to the schedule model in Example 5, describe 
how one might represent interlocking (overlapping) Boards of Directors 
of large corporations with a graph model, that is, a graph telling which 
Boards have overlapping members. 

(b) Suppose Figure 1.3 is the graph of such interlocking boards. What is 
the maximal number of boards that could meet at the same time? 


Which of the following graphs are interval graphs? If they are, give a 
model of overlapping intervals; if not, explain why not. 


28. 


29. 


30. 


31. 


32. 


33. 


34, 
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(a) Figure 1.3 (b) Figure 1.4 (c) Figure 1.5 
(d) ‘ 
a b d f & 


Find a graph G not containing a circuit of length four or more (without 
chords), such that G is not an interval graph but if any vertex is removed 
from G the resulting graph is always an interval graph. 


A unit interval graph is an interval graph in which all intervals have the 
same length. Find unit interval models for the following graphs. 


(a) Figure 1.la (b) Figure 1.3 
(c) Figure 1.7a with vertex e deleted 


Show that the graph below on the right is an interval graph but not a unit- 
interval graph. 


Show that adjacency in the graph above on the left can be modeled by 
overlapping rectangles in the plane. 


Six students A, B, C, D, E, F went to the library once last Monday night. 
One of them stole a book. Each student is questioned about whom they 
saw at the library, and all tell the truth except the thief (it is possible that 
one student saw another without in turn being observed, but we assume 
that one must have seen the other if both were there at the same time). A 
saw B and E; B saw A and F; C saw D and F; D saw A and F: E saw B 
and C; and F saw C and E. Who is the thief? (Hint: Use an interval graph.) 


Suppose a certain type of CB radio can be tuned to receive frequencies 
ranging from R to R’ but can transmit only from T to T’, where R < T 
<T' < R'. We make a “‘CB graph’”’ for a set of CB radios with an edge 
between two vertices if they represent CB radios that can mutually receive 
each other. 

(a) Show that any interval graph can be a CB graph. 


(b) Find a CB graph that is not an interval graph. 


Find a vertex basis in the following directed graphs: 
(a) Figure 1.15 (b) Figure 1.4 
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36. 
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(c) Figure 1.5 with edges directed by alphabetical order [e.g., edge (a, e) 
is directed from a to e] 


Show that the vertex basis in a directed graph is unique if there is no 
sequence of directed edges that forms a circuit in the graph. 


Find the longest time that might be required to complete the project in 
Example 9 under the following assumptions: 

(a) All tasks take worst-case times. 

(b) One task is best case and remaining tasks are estimated times. 

(c) At most one task is best case and at most one is estimated time. 


Suppose you have the extra resources in Example 9 to reduce the estimated 
time for tasks by a total of three months. How should you allocate this 
reduction among tasks so as to reduce as much as possible the completion 
time if all jobs take their estimated times? 


. A game for two players starts with an empty pile. Players take turns putting 


1 or 2 or 3 pennies in the pile. The winner is the player who brings the 
value of the pile up to 16¢. 


(a) Make a directed graph modeling this game. 


(b) Show that the second player has winning strategy by finding a set of 
four ‘‘good’’ pile values, including 16¢, such that the second player 
can always move to one of the ‘‘good’’ piles (when the second player 
moves to one of the good piles, the next move of the first player must 
be to a non-good pile, and from this position the second player has a 
move to a good pile, etc.). 


. The parsing of a sentence can be represented by a directed graph, with a 


vertex S (for the whole sentence) having edges to vertices Su (subject) and 
P (predicate), then Su and P having edges to the parts into which they are 
decomposed into pieces, and so on. 

Consider the abstract grammar with decomposition rules: § — AB, S 
— BA, A — ABA, B — BAS, and B — S. For example, BAABA can be 
“‘parsed’’ as shown below. 


a 
/ IN 
7 ‘ 


40. 


41. 
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Find a parsing graph for each of the following (or explain why no 
parsing exists): 
(a) BABABABA (b) BBABAABA 
A signed directed graph has a plus (+) or a minus (—) associated with 


each edge (i; /) indicating whether person i has a positive or a negative 
influence on person j. If i expresses an opinion in favor of an idea and 


a + b 
+ + 
d c 


there is an edge (i; /) with a ‘‘+,”’ then a unit of time later j will express 
an opinion in favor of the idea. If there are also edges (j; &) with ‘‘+”’ 
and (j; m) with ‘‘—,’’ then two units after i first spoke, k will speak in 
favor and m against the idea. If a person is influenced by several people 
who just expressed an opinion in the previous unit of time, the person 
expresses a positive, negative, or no opinion, depending on whether the 
algebraic sum of influences is positive, negative, or zero, respectively. If 
person a expresses a positive opinion at time 0 in the above graph, who 
will express opinions (of what type) at time 3? 


Ask a friend to pose three subject areas in which you must find problems 
having graph models. 


1.22 ISOMORPHISM 


In this section we investigate some of the basic structure of graphs. We are 
interested in properties that distinguish one vertex in a graph from another 
vertex and, more generally, that distinguish one graph from another graph. We 
motivate this discussion with the question: How can we tell if two graphs are 
really the same graph, but drawn differently and with different names for the 
vertices? For example, are the two 5-vertex graphs in Figure 1.10 different 
versions of the same graph? 


A graph can be drawn on a sheet of paper in many different ways. Thus, 


it is usually possible to draw a graph in two ways that would lead a casual 
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viewer to consider the drawings to be ‘‘different’’ graphs. This motivates the 
following definition. 


Two graphs G and G’ are called isomorphic if there exists a one-to- 
one correspondence between the vertices in G and the vertices in G’ 


such that a pair of vertices are adjacent in G if and only if the correspond- 
ing pair of vertices are adjacent in G’. 


Such a one-to-one correspondence of vertices that preserves adjacency is 
called an isomorphism. A useful way to think of isomorphic graphs is as 
follows: The first graph can be redrawn on a transparency that can be exactly 
superimposed over a drawing of the second graph. 

To be isomorphic, two graphs must have the same number of vertices and 
the same number of edges. The two graphs in Figure 1.10 pass this initial test. 
Both graphs have one vertex, e and 5, respectively, at the end of just one edge. 
Then any isomorphism of these two graphs must match e with 5. Also, the 
vertices at the other ends of the edge from e and 5 must be matched; that is, 
d matched with 4. (Think of superimposing one graph over the other.) The 
remaining three vertices in each graph are mutually adjacent (forming a triangle) 
and also are all adjacent to d or 4, respectively. Thus the matching a with /, 
b with 2, and c with 3 (or any other matching of these two subsets of three 
vertices) will preserve the required adjacencies. The correspondence a— 1, b—2, 
c—3, d—4, e—5 is then an isomorphism, and the two graphs are isomorphic. 
To visualize how they can be made to look the same, think of moving vertices 
4 and 5 in the right graph upward and to the right [past edge (1,3)], so that 1, 
2, 3, 4 form a quadrilateral with crossing diagonals. 

The degree deg(x) of a vertex is the number of edges incident to the vertex. 
Degrees are preserved under isomorphism, that is, two matched vertices must 
have the same degree. Then in Figure 1.10, e has to be matched with 5 and d 
matched with 4 because they are the unique vertices of degree 1 and 4 in their 
respective graphs. Further, two isomorphic graphs must have the same number 
of vertices of a given degree. For example, if they are to be isomorphic, the 
two graphs in Figure 1.10 must both have the same number of vertices of 
degree 3— they do; both have 3 vertices of degree 3. 


Ja 


Figure 1.10 » c 2 3 
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A subgraph is a graph formed by a subset of vertices and edges of a larger 
graph. If two graphs are isomorphic, then corresponding subgraphs must be 
isomorphic. In Figure 1.10, removal of vertices e and 5 (and their incident 
edges) leaves two isomorphic subgraphs consisting of four mutually adjacent 
vertices. Once this subgraph isomorphism is noted, isomorphism of the whole 
graphs is easily demonstrated. 

Subgraphs can be used to test for isomorphism in the following way. If a 
graph G has a set of 6 vertices forming a chordless circuit of length 6 (chordless 
means there are no other edges between these 6 vertices except the 6 edges 
forming the circuit), then any graph isomorphic to G must also have a set of 
6 vertices generating such a chordless 6-circuit. 

A graph in which each vertex is adjacent to all the other vertices is called 
a complete graph on n vertices, denoted K,. A complete graph on 2 vertices, 
K,, is just an edge. Complete subgraphs are in a sense the building blocks of 
all larger graphs. For example, both graphs in Figure 1.10 consist of a K, and 
a K, joined at a common vertex. Conversely, every graph on n vertices is a 
subgraph of K,,. 

Before examining other pairs of graphs for isomorphism, let us mention 
the practical importance of determining whether two graphs are isomorphic. 
Researchers working with organic compounds build up large dictionaries of 
compounds that they have previously analyzed. When a new compound is 
found, they want to know if it is already in the dictionary. Large dictionaries 
can have many compounds with the same molecular formula but differing in 
their structute as graphs (and possibly in other ways). Then one must test the 
new compound to see if its graph-theoretic structure is the same as the structure 
of one of the known compounds with the same formula (and the same in other 
ways), that is, whether the new compound is graph-theoretically isomorphic to 
one of a set of known compounds. A similar problem arises in designing 
efficient integrated circuitry for an electrical network. If the design problem 
has already been solved for an isomorphic network (or if a piece of the new 
network is isomorphic to a previously designed network), then valuable savings 
in time and money are possible. 


Example 1 


Are the two graphs in Figure 1.11 isomorphic? 

Both graphs have 8 vertices and 10 edges. Let us examine the degrees of 
the different vertices. We see that b, d, f, h and 3, 4, 7, 8 have degree 2, while 
the other vertices have degree 3. Then the two graphs have the same number 
of vertices of degree 2 and the same number of degree 3. The respective 
subgraphs of the 4 vertices of degree 2 (and the edges between these degree- 
2 vertices) in each graph must be isomorphic if the whole graphs are isomorphic. 
However, there are no edges between any pair of b, d, f,; h, while the other 
subgraph of degree-2 vertices has two edges: (4, 3) and (8, 7). So the subgraphs 
of degree-2 vertices are not isomorphic, and hence the two full graphs are not 


22 Chapter 1 Elements of Graph Theory 


Figure 1.11 » 


isomorphic. The reader can also check that the two subgraphs of degree-3 
vertices in each graph are not isomorphic. = 


The vertices of degree 2 in the left graph in Figure 1.11 form a subgraph 
of mutually nonadjacent vertices. Such a subgraph is called a set of isolated 
vertices. 


Example 2 


Are the two graphs in Figure 1.12 isomorphic? 

The two graphs both have 7 vertices and 14 edges. Every vertex in both 
graphs has degree 4. Further, both graphs exhibit all the symmetries of a regular 
7-gon. With no distinctions possible even among vertices within the same 
graph, our only option is to try to construct an isomorphism. Start with vertex 
a in the left graph. By symmetry, we can match a to any vertex in the right 
graph (that is, if the two graphs are isomorphic, there will exist an isomorphism 
with a matched to any vertex in the right graph). Let us use the match a—J. 

The set of neighbors of a (vertices adjacent to a) must be matched with 
the set of neighbors of /. Let us look at the subgraphs formed by these neighbors 
of a and J. See Figure 1.13. Both subgraphs are paths: One is f to g tod toc, 
and the other is 7 to 4 to 5 to 2. The matching must make these path subgraphs 
isomorphic. Thus, f and c must be matched with 7 and 2 (matching ends of 
the two paths). By the horizontal symmetry of the graphs, it makes no difference 
which way f and c are matched—say f—7 and c—2. Then to complete the 


oy 
mo 
oa 
w 


Figure 1.12 e d 5 4 
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Figure 1.13 5 4 


isomorphism of neighbors of a and /, we must match g with 4 and b with 5. 
Now there remain only two unmatched vertices in each graph: d, e and 3, 6. 
Vertex g is adjacent to e but not d, and its matched vertex 4 is adjacent to 3 
but not to 6. Thus we must match e with 3 and d with 6. 

In sum, allowing for symmetries to match a with J and f with 7, we 
conclude that if the graphs are isomorphic, one isomorphism must be a—/, 
b—5, c—2, d—6, e-3, f—7, g—4. Checking edges, we see that the graphs are 
indeed isomorphic with this matching (if this matching were found not to be 
an isomorphism, then the two graphs would not be isomorphic since the matches 
we made were all forced except for the symmetry involving the matches of a 
and f). = 


Given a graph G = (V, E), its complement is a graph G = (V, £) with 
the same set of vertices but now with edges between exactly those pairs of 
vertices not linked in G. Two graphs G, and G, will be isomorphic if and only 
if G, and G, are isomorphic. The isomorphism problem in Example 2 is easy 
to answer using complements. Figure 1.14 shows the complements of the two 
graphs in Figure 1.12. Clearly, both these complementary graphs are just a 
(twisted) circuit of length 7 and hence are isomorphic. 

In general, if a graph has more pairs of vertices joined by edges than pairs 
not joined by edges, then its complement will have fewer edges and thus will 
probably be simpler to analyze. 


Example 3 


Are the two directed graphs in Figure 1.15 isomorphic? 
Each graph has 8 vertices and 12 edges, and each vertex has degree 3. If 
we break the degree of a vertex into two parts, the in-degree (number of edges 


Figure 1.14 e d 5 4 
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Figure 1.15 


pointed in toward the vertex) and out-degree (number of edges pointed out), 
we see that each graph has 4 vertices of in-degree 2 and out-degree 1, and 
each graph has 4 vertices of in-degree 1 and out-degree 2. We could try to 
build an isomorphism as in the previous example by starting with a match (by 
a symmetry argument) between a and | and then matching their neighbors 
(with edge directions also matched), and so forth. 

However, there is a basic difference in the directed path structure of the 
two graphs. We will exploit this difference to prove nonisomorphism. In the 
left graph we can draw a directed path from any given vertex to any other 
vertex by going clockwise around the circle of vertices: The outer edges form 
a directed circuit through all the vertices in the left graph. But in the right 
graph, all edges between the vertex subsets V, = {1, 2,3, 4} and V, = {5, 6,7, 
8} are directed from V, to V2, and thus there can be no directed paths from 
any vertex in V, to any vertex in V, (nor is there a directed circuit through all 
the vertices). Thus, the two graphs are not isomorphic. = 


1.2 EXERCISES 


1. List all nonisomorphic undirected graphs with four vertices. 
2. List all nonisomorphic directed graphs with three vertices. 
3. Explain why the subgraphs of vertices of degree 3 in Figure 1.11 are not 
isomorphic. 
4. If directions are ignored, are the two graphs in Figure 1.15 isomorphic? 
5. Which of the following pairs of graphs are isomorphic? Explain carefully. 
3 


(a) ¢ 
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s of graphs are isomorphic? Explain carefully. 


(b) 


6. Which of the following pair 


(a) . ; 
AB BEA 
. q * 5 f € 6 4 
(c) ; (d) 
OO 6 e 
(e) (f) 
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7. Which pairs of graphs in this set are isomorphic? 
14 ge 
7 8 19 
F : Sar 13 15 
5 2 24 
4 3 VON 18 16 
12 11 23 20 
17 
25 ss a5 
ee - \2/ 37 
. : / 
28 36 apy eal = AO 
8. Which pairs of graphs in this set are isomorphic? 
18 
2 9 10 11 12 45 19 «28 28 33 35 
= se 
3 16 15 14 13. 23 21 32 29 39 37 
22 
9. Suppose each edge in the graphs in Figure 1.12 is directed from the smaller 


(numerically or alphabetically) end vertex to the larger end vertex. Are the 
two resulting directed graphs isomorphic? 


10. Are the following pairs of directed graphs isomorphic? 
e b 1 2 3 
a b 1 2 
UN © x 
: 5 4 : Z 6 5 4 


11. Show that all 5-vertex ‘graphs with 2 edges incident to each vertex are 
isomorphic. 


12. Are there any 6-vertex graphs with 3 edges incident to each vertex that 
are not isomorphic to one of the graphs in Exercise 7? 
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13. What is the minimum number of vertices that must be deleted from the 
two graphs in the following figures to get isomorphic subgraphs? 


(a) Figure 1.11 (b) Figure 1.15 (c) Exercise 5(d) 

14. What are the sizes of the largest complete subgraphs in the two graphs in 
Exercise 6(h)? 

15. What are the sizes of the largest sets of isolated vertices in the two graphs 
in Exercise 6(g)? 

16. Build 6-vertex graphs with the following degrees of vertices, if possible. 
If not possible, explain why not. 
(a) Three vertices of degree 3 and 3 vertices of degree 1. 
(b) Vertices of degrees 1, 2, 2, 3, 4, 5. 


1.3. EDGE COUNTING 


There is very little in the way of general assertions that can be made about all 
graphs. There is one useful general theorem, a formula for counting edges. 


Theorem 1 


In any graph, the sum of the degrees of all vertices is equal to twice the number 
of edges. 


Proof 


Summing the degrees of all vertices counts all instances of some edge being 
incident at some vertex. But each edge is incident with two vertices, and so 
the total number of such edge—vertex incidences is simply twice the number 
of edges. The theorem is now proved. # 


As an illustration of Theorem 1, consider the graph in Figure 1.16 with 6 
vertices, three of degree 4, two of degree 3, and one of degree 2. The sum of 
the degreesis4+4+4+3+3+ 2 = 20. This sum must equal twice the 
number of edges. The reader can check that the number of edges in this graph 
is 10. 

For a set of integers to sum to an even integer, there must be an even 
number of odd integers in the sum. Thus we obtain: 


Corollary 
In any graph, the number of vertices of odd degree is even. 


Let us now look at uses of this theorem and corollary. 
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Figure 1.16 f e 


Example 1 


Suppose we want to construct a graph with 20 edges and have all vertices of 
degree 4. How many vertices must the graph have? 

Let v denote the number of vertices. The sum of the degrees of the vertices 
will be 4v, and by the theorem this sum must be twice the number of edges: 
4v = 2 X 20 = 40. Hence v = 10. » 


Example 2 


How many edges are there in K,, a complete graph on n vertices? 

Recall that K, has an edge between all possible pairs of vertices. At any 
given vertex, there will be edges going to each of the n—1 other vertices in 
K,,, and so each vertex has degree n—1. The sum of the degrees of all vertices 
in K, will be n(n — 1)—n vertices of degree n — 1. Since this sum equals 
twice the number of edges, the number of edges is n(n — 1)/2. m= 


Example 3 


Is it possible to have a group of seven people such that each person knows 
exactly three other people in the group? 

If we try to model this problem using a graph with a vertex for each person 
and an edge between each pair of people who know each other, then we would 
have a graph with 7 vertices all of degree 3. But this is impossible by the 
corollary—-the number of vertices of odd degree must be even—and so no 
such set of seven people can exist. = 


We next present a puzzle that seems to have no relation to graphs. 


Example 4 


Two people start at locations A and Z at the same elevation on opposite sides 
of a mountain range whose summit is labeled M. See Figure 1.17a. We pose 
the following puzzle: Is it possible for the people to move along the range in 
Figure 1.17a to meet at M in a fashion so that they are always at the same 
altitude every moment? We shall show this is possible for any mountain range 
like Figure 1.17a. The one assumption we make is that there is no point lower 
than A (or Z) and no point higher than M. 
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Figure 1.17 


We make a range graph whose vertices are pairs of points (P,, Pr) at the 
same altitude with P, on the left side of the summit and P, on the right side, 
such that one of the two points is a local peak or valley (the other point might 
also be a peak or valley). The vertices for the range in Figure 1.17a are shown 
in the graph in Figure 1.17b. We make an edge joining vertices (P,, Pr) and 
(P1, Pp) if the two people can move constantly in the same direction (both 
going up or both going down) from point P, to point P; and from Pr to Pr, 
respectively. Our question is now, Is there a path in the range graph from the 
starting vertex (A, Z) to the summit vertex (M, M)? For the graph in Figure 
1.17b, the answer is obviously yes. 

We claim that vertices (A, Z) and (M, M) in any range graph have degree 
1, whereas every other vertex in the range graph has degree 2 or 4. (A, Z) has 
degree 1 because when both people start climbing up the range from their 
respective sides, they have no choice initially but to climb upwards until one 
arrives at a peak. In Figure 1.17a, the first peak encountered is C on the left, 
and so the one edge from (A, Z) goes to (C, X). A similar argument applies 
at (M, M). Next consider a vertex (P,, Pg) where one point is a peak and the 
other point is neither peak nor valley, such as (E, W). From the peak we can 
go down in either direction: at W, we can go down toward Z or toward U. In 
either direction, the people go until one (or both) reaches a valley. At (E, W), 
the two edges go to (D, Y) and (D, U). So such a vertex has degree 2.A 
similar argument applies if one (but not both) point is a valley. It is left as an 
exercise for the reader to show that if a vertex (P:, Pz) consists of two peaks 
or two valleys, such as (D, U), it will have degree 4. (A vertex consisting of 
a valley and a peak will have degree 0—why’?) 

Suppose there were no path from (A, Z) to (M, M) in the range graph. We 
use the fact that starting vertex (A, Z) and summit vertex (M, M) are the only 
vertices of odd degree. The part of the range graph consisting of (A, Z) and 
all the vertices that can be reached from (A, Z) would form a graph with just 
one vertex of odd degree, namely, (A, Z). This contradicts the corollary and 
so any range graph must have a path from (A, Z) to (M, M). = 


Many interesting properties in graph theory are dependent on certain sets 
of edges having even size. Euler cycles, discussed in Section 2.1, arise when 
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A a 
B b 
Cc (a 
D d 
Figure 1.18 & e 


all vertices have even degree. In Example 2 of Section 1.1, we considered a 
matching problem involving the graph shown in Figure 1.18. The vertices on 
the left represented people and the vertices on the right represented jobs. An 
edge links a left vertex to a right vertex to indicate that a certain person can 
perform a certain job. There can never be an edge between two vertices on the 
left or between two vertices on the right. Such a graph is called a bipartite 
graph. 

Bipartite graphs can be characterized by the fact that all circuits in such 
graphs have even length (if there are no circuits, the graph is also bipartite), 
where the length of a circuit or path is the number of edges in it. 


Theorem 2 
A graph G is bipartite if and only if any circuit in G has even length. 


Proof 


First we show that if G is bipartite, then any circuit has an even length. If G 
is bipartite so that it can be drawn with all edges connecting a left vertex with 
a right vertex, then any circuit (%), x), X3,.. . , Xn» 1) has alternately a left vertex, 
then a right vertex, then a left vertex, etc., assuming the first vertex x, is on 
the left. Odd subscripted vertices are on the left and even subscripted vertices 
on the right. See Figure 1.19a. Since x, is adjacent to x,, x, must be on the 
right and so its subscript is even. That is, there are an even number of vertices 
in the circuit. Any circuit has the same number of edges as vertices, and thus 
this circuit has even length. 

Suppose next that any circuit in G (there may be no circuits) has even 
length. We show how to construct a bipartite arrangement of G. Take any given 
vertex, call it a, and put it on the left. Put all vertices adjacent to a on the right. 
Next put all vertices that are two edges away from a, that is, at the end of 
some path of length 2 from a, on the left. In general, if there is a path of odd 
length between a and a vertex x, put x on the right. If there is a path of even 
length between a and x, put x on the left. 

There cannot be distinct paths P and P’ between a and x of odd and of 
even lengths, respectively, since taking P from a to x and then returning to a 
on P’ yields an odd-length circuit. This is impossible, since all circuits have 
even length. (If P and P’ have a vertex g in common besides a and x, then a 
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further argument is needed to show that there is a circuit of odd length. See 
Exercise 13 for details.) 

Similarly, we argue that there cannot be an edge between two vertices, 
say, b and c, both on the left. There must exist even-length paths Q, Q’ joining 
a with b and c, respectively (since b and c are on the left). See Figure 1.195, 
in which Q is dashed and Q’ is dotted. Going from a to b on Q, then using 
edge (b, c) followed by Q' back to a yields an odd-length circuit—impossible. 
By similar reasoning, two vertices on the right cannot be adjacent. Thus, we 
have a bipartite arrangement of G. 


Example 5 


Is the graph in Figure 1.20a bipartite? 

Pick any vertex, say a, and put it on the left. We follow the approach in 
the second half of the proof of Theorem 2. Put vertices joined to a by an even- 
length path on the left and vertices joined to a by an odd-length path on the 
right. If all the circuits in this graph are even-length, then the reasoning in the 
above proof guarantees that our placement of vertices will yield a bipartite 
arrangement. If we end up with two vertices on the left (or on the right) being 
adjacent, then the graph cannot be bipartite. In this case, the construction 
succeeds, as shown in Figure 1.20b. = 


S 


Qo 


Figure 1.20 (a) (b) 
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1.3 > EXERCISES 


1. How many vertices will the following graphs have if they contain: 
(a) 12 edges and all vertices of degree 2. 
(b) 15 edges, 3 vertices of degree 4, and the other vertices of degree 3. 
(c) 20 edges and all vertices of the same degree. 


2. For each of the following questions, describe a graph model and then 
answer the question. 


(a) Must the number of people at a party who do not know an odd number 
of other people be even? 


(b) Must the number of people ever born who had (have) an odd number 
of brothers and sisters be even? 


(c) Must the number of families in Alaska with an odd number of children 
be even? 


(d) For each vertex x in the graph below, let 5(x) denote the number of 
vertices adjacent to at least one of x’s neighbors. Must the number of 
vertices with s(x) odd be even? Is this true in general? 


b 


3. What is the largest possible number of vertices in a graph with 19 edges 
and all vertices of degree at least 3? 


4. If a graph G has n vertices, all of which but one have odd degree, how 
many vertices of odd degree are there in G, the complement of G? 


5. Suppose all vertices of a graph G have degree p, where p is an odd number. 
Show that the number of edges in G is a multiple of p. 


6. There used to be 26 football teams in the National Football League (NFL) 
with 13 teams in each of two conferences (each conference was divided 
into divisions, but that is irrelevant here). An NFL guideline said that each 
team’s 14-game schedule should include exactly 11 games against teams 
in its own conference and 3 games against teams in the other conference. 
By considering the right part of a graph model of this scheduling problem, 
show that this guideline could not be satisfied! 

7. Prove a directed version of Theorem 1: The sum of the in-degrees of 
vertices in a directed graph equals the sum of the out-degrees of vertices, 
and further each sum equals the number of edges. 
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8. 


10. 


12. 


13. 


Build the range graph for each of the following mountain ranges and use 
the graph to find a solution to the problem in Example 4. 


(b) 


. Prove in a range graph that if a vertex (P1, Pz) consists of two peaks or 


two valleys, it will have degree 4. 


Prove in a range graph that if a vertex (P,, Pr) consists of a valley and a 
peak, it will have degree 0. 


. Determine whether the following graphs are bipartite. If so, give the parti- 


tion into left and right vertices as in Figure 1.20. 
(a) Figure 1.5 (b) Figure 1.7a (c) Figure 1.11 (left graph) 


Determine whether the following graphs are bipartite. If so, give the parti- 
tion into left and right vertices as in Figure 1.20b. 


(b) 


In the second part of the proof of Theorem 2, one can encounter the situation 
where there exist paths P and P’ between a and x, P of odd length and P’ 
of even length, and these two paths have one or more vertices in common. 
One must show that a subset of the edges on these two paths forms an 
odd-length circuit. Let q be the first vertex on P, starting from a, that also 
lies on P’. Show that either the circuit from a along P to g and then back 
on P’ to a, or the edge sequence from q along P to x and then back on P’ 
to g, has odd length. In the latter case, if the edge sequence is not a circuit, 
then it has a vertex g’ on both P and P’. Repeat the same reasoning 
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considering the circuit on P from q to q’ and then back on P’ to q or the 
edge sequence from q’ along P to x and back along P’ to q’. 


—=—«er Oh 


1.4 PLANAR GRAPHS 


The most natural examples of graphs are street maps and telephone networks. 
The graphs that arise from such physical networks usually have the property 
that they can be depicted on a piece of paper without edges crossing (different 
edges meet only at vertices). We say that a graph is planar if it can be drawn 
on a plane without edges crossing. We use the term plane graph to refer to a 
planar depiction of a planar graph. The two graphs in Figure 1.21 are both 
planar. The graph in Figure 1.21b is a plane graph. The graph in Figure 1.2la 
is planar since it can be redrawn in the form of the graph in Figure 1.21b. 

Our principle focus in this section is determining whether a graph is planar. 
We take two approaches. The first approach involves a systematic method for 
trying to draw a graph edge-by-edge with no crossing edges, in the same spirit 
of the way in Section 1.2 we tried to determine if two graphs are isomorphic, 
The second approach develops some theory with a goal of finding useful 
properties of planar graphs. These properties will yield simple tests for showing 
that a graph is not planar. 

Remember that if a graph G has been drawn with edges crossing, this does 
not mean the graph is nonplanar. There may be another way to draw the graph 
without edges crossing, as illustrated by the graph in Figure 1.21a, which can 
be redrawn to be the plane graph in Figure 1.21b. 

Probably the most important need today for testing whether a graph is 
planar arises in designing electronic circuits. Complex integrated circuits are 
nonplanar and require several layers of (planar) circuit connections in their 
wiring. But the number of layers is limited and so a major problem in integrated 
circuit design is decomposing a large circuit into a minimal number of parts 
that are known to be planar. 
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Figure 1.21 
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A more mundane, but still important, use of planarity testing arises in 
checking data-entry errors in planar networks. When a large planar graph such 
as a city’s street network is entered on data terminals for computerized analysis 
(say, for use by a computer program to determine routes for street sweepers), 
it is a common error-checking technique to test first whether the graph as 
typed in is indeed planar (most data-entry errors would make the graph non- 
planar). 

Planar graphs were first studied extensively by mathematicians over 100 
years ago in connection with a map-coloring problem. 


Example 1 


One of the oldest problems in graph theory concerns map coloring. The question 
is how many colors are needed to color countries on some map so that any 
pair of countries with a common border are given different colors. A map of 
countries is a planar graph with edges as borders and vertices where borders 
meet. See Figure 1.22a. However, a closely related planar graph called a dual 
graph of the map graph is more useful. The dual graph is obtained by making 
a vertex for each country and an edge between vertices corresponding to two 
countries with a common border. See Figure 1.22b. (Normally a vertex is also 
included for the unbounded region surrounding the map.) 

The question now is how many colors are needed to ‘‘color’’ the vertices 
such that adjacent vertices have different colors. In Figure 1.22b, vertices A, 
B, C, D form a complete subgraph on four vertices and so each requires a 
different color, four colors in all. With four colors, we can also properly color 
the remaining vertices. 


A E 
D 

F 

; G 


Figure 1.22 (6) 
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One of the most famous unsolved problems in all of mathematics during 
the last century was the conjecture that ail planar graphs can be properly colored 
with only four colors. In trying to resolve this conjecture, mathematicians 
developed a large theory about planar graphs. In 1976 the four-color conjecture 
was proven true by Appel and Haken using an immense computer-generated, 
case-by-case, exhaustive analysis (there were 1955 classes of graphical configu- 
rations to be considered, each involving numerous subcases). We will take a 
closer look at graph coloring in the next chapter. = 


We shall now to try to find a systematic way to draw a graph in the plane 
without edges crossing. As with isomorphism between two graphs, we want 
to be able to conclude that a graph is not planar if our construction fails. We 
shall call our approach the circle-chord method. It starts by finding a circuit 
that contains all the vertices of our graph (though such circuits do not exist for 
all graphs, they are common in the types of graphs we will be considering in 
this section). We draw this circuit as a large circle. The remaining noncircuit 
edges, which we will call chords, must be drawn either inside the circle or 
outside the circle. 

We choose one chord and draw it, say, outside the circle. If properly chosen, 
this chord will force certain other chords to be drawn inside the circle (if also 
placed outside the circle, they would have to cross the first chord). These inside 
chords will force still other chords to be drawn outside, and so on. After the 
first chord is drawn, the choice of placing subsequent chords inside or outside 
is forced. Thus, if we reach a point where a new chord will have to cross some 
previous chord, whether the new chord is drawn inside or outside, we can claim 
that the graph must be nonplanar. If all the chords can be added without crossing 
other chords, then the graph is planar. 

A critical decision is whether the first chord drawn should go inside or 
outside the circle. We claim that it makes no difference, because of the following 
inside—outside symmetry of a circle. Consider two maps of the earth, the first 
with the North Pole at the center of the map, the second with the South Pole 
at the center. Suppose each map has a path drawn in the Northern Hemisphere 
linking two cities on the equator. In the first map (with the Northern Hemisphere 
inside the equator) the path is inside the equator’s circle, whereas in the second 
map the path is outside the equator. Think of the circle formed by the circuit 
in the circle—chord method as the equator and the first chord as the path between 
two cities. Whether the chord is inside or outside the circle is just a matter of 
which ‘‘map’’ of the earth one uses. 

We note that very efficient algorithms exist to test whether a graph is 
planar, whereas there is no efficient algorithm known to test whether two graphs 
are isomorphic. The planarity testing algorithms are fairly complicated and 
beyond the scope of this text. 
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(a) 
Use the circle—chord method to determine whether the graph in Figure 1.23a 
is planar. Let us look for a circuit with all 8 vertices. One possibility is a-f-c- 
h-d-g-b-e-a. Now try to add the other four edges (a, h), (b, f), (c, 8), (d, e). 
By inside—outside symmetry, we can start by drawing (a, h) outside. See Figure 


1.23b. Then (b, f) and (c, g) must go inside. Then (d, e) must go outside. So 
the graph is planar. = 


EGE 
Sa c 
b g 
SS 
Figure 1.23 (6) 


Example 2 


Example 3 


Show that K33, the graph in Figure 1.24a, is nonplanar. The notation K3, 
indicates that this graph is a complete bipartite graph consisting of two sets of 
3 vertices with each vertex in one set adjacent to all vertices in the other set. 
Applying the circle—chord method, we form a circuit containing all six vertices 
in K, and then try to add the remaining edges (not in the circuit) as inside 
and outside chords. 

There are several choices for a 6-vertex circuit. Suppose we use the circuit 
(1, 4, 2, 5, 3, 6, 1) and draw it in a circle as shown in Figure 1.24b. Next the 
edges (1, 5), (2, 6), and (3, 4) must be added. First draw chord (7, 5). By the 
inside—outside symmetry of a circle discussed above, we can assume that (/, 
5) is drawn inside the circuit, as in Figure 1.24b. Then (2, 6) must be drawn 
outside the circuit to avoid crossing chord (/, 5). Finally, we must draw (3, 
4): If drawn outside the circuit, (3, 4) would have to cross chord (2, 6); if 
drawn inside the circuit, (3, 4) would have to cross chord (J, 5). Thus K3; 
cannot be drawn in a planar depiction. Hence K33 is nonplanar. = 


Figure 1.24 (a) (d) 
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Using a mixture of theory and careful, case-by-case analysis, it is possible 
to prove that any nonplanar graph always contains a K;; or a K; (shown in 
Figure 1.25a) as a subgraph or a slight modification of these two graphs. It is 
left as an exercise to show that the circle—chord method (used in Example 3) 
can be used to show that K; is nonplanar. Thus, these two graphs are the 
“‘reason’’ that a graph cannot be drawn in a planar fashion. 

We have to allow a slight variation in K;; and K; in nonplanarity analysis. 
Figure 1.25b shows a K;3 graph that has been subdivided by adding vertices 
in the middle of some of its edges. The resulting graph is no longer a K33 and 
does not contain K33 as a subgraph, yet it is still nonplanar (repeatedly adding 
a vertex in the middle of an edge cannot make a nonplanar graph planar). We 
say that a subgraph is a K3; configuration if it can be obtained from a K;; by 
adding vertices in the middle of some edges. A Ks configuration is de- 
fined similarly. The following nonplanar graph characterization theorem was first 
proved by the Polish mathematician Kuratowski in 1930. 


Theorem 1 


A graph is planar if and only if it does not contain a subgraph that is a Ks or 
K;3 configuration. 


If the circle—chord method shows that a graph is nonplanar, then by Theorem 
1 this graph has a subgraph that is a Ks or K33 configuration. Finding such a 
configuration can sometimes be tricky. However, the following observation is 
helpful: Most nonplanar graphs contain a K,3 configuration, K;configurations 
are less common. All nonplanar graphs in the exercises have K33 configurations. 
Note also that the depiction of a K;; in Figure 1.24b as a 6-vertex circle with 
3 chords joining pairs of opposite vertices is the way that a K;3; configuration 
normally arises in a nonplanar graph. 


Example 4 


Use the circle—chord method to determine whether the graph in Figure 1.26a 
is planar. If it is nonplanar, find a subgraph that is a K33 configuration. 

First we seek a circuit that visits all vertices. Many such circuits exist. 
Choose the circuit a-b-c-d-e-f-g-h-a, shown in Figure 1.26b. Say we pick (a, 


Figure 1.25 (a) (b) 
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(a) 


(c) 


Figure 1.26 


d)as our first chord to draw. By inside—outside symmetry, it makes no difference 
whether we draw (a, d) inside or outside the circuit. Put it inside. Then (b, e) 
must go outside. Next look for another chord reaching across the circle. Observe 
that chord (c, g) cannot be drawn either inside or outside. So the graph is 
nonplanar. 

Next we look for a K33 configuration. We simplify matters by restricting 
our attention to the subgraph in Figure 1.26b, whose crossing edges proved 
that the full graph had to be nonplanar. A K33 configuration has 6 vertices of 
degree 3 (corresponding to the vertices of a K;;) plus some number of vertices 
of degree 2 that subdivide the edges of a K,3. The easiest way to find a K3,3 
configuration in a given graph is to eliminate edge subdivisions in the graph 
(remove each vertex of degree 2 and combine the 2 edges at each such vertex 
into a single edge). Figure 1.26c shows the subgraph in Figure 1.26b with 
subdivisions removed. The graph in Figure 1.26c looks just like the depiction 
of a K33 in Figure 1.24b. Thus, the subgraph in Figure 1.26b was a K33 configura- 
tion. # 


Finding a K;3 configuration in the graph in Figure 1.26a (without using 
the subgraph in Figure 1.26b) would be very challenging. The problem in 
finding a K;3 configuration in a general nonplanar graph is the following. Let 
z be some vertex of degree 3 in the original graph and suppose that just two 
of the z’s edges, say (z, r) and (z, q), are part of a K33 configuration. Then z 
corresponds to a subdivision vertex in this K;3; configuration, and these two 
edges of z need to be fused into a single edge (r, q) to find the underlying K33. 
That is, z disappears and a new edge (r, q) is created. Using the subgraph 
produced by the circle—chord method makes it much easier to identify vertices 
of degree 2 in a K33 configuration whose two edges should be fused together. 

There are many different plane graph depictions that can be drawn for a 
planar graph. For example, we can redraw the plane graph in Figure 1.27a by 
making the region bounded by the triangle (d, e, f) very large and bringing 
vertex a to the right side, as in Figure 1.275. Now flip the part of the graph 
above and to the right of the triangle inside the triangle, obtaining the plane 
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(a) (b) (c) 
Figure 1,27 


graph in Figure 1.27c. The triangle (d, e, f) has become the outside boundary 
of the whole graph. The boundary of any region can be converted to the outside 
boundary of the whole graph by a similar process. 

Despite this variability in plane graph depictions of a planar graph, one 
important property of the plane depictions does not change. The number of 
regions is always the same. For simplicity, assume that G is a connected planar 
graph (connected means ‘‘in one piece,’’ i.e., with paths between every pair 
of vertices). If v and e denote the number of vertices and edges, respectively, 
in G, then a plane graph depiction of G will always have a number of regions 
r given by the formula, r = e — v + 2. Note that the unbounded region outside 
the graph is counted as a region. This remarkable formula for r was discovered 
by Euler in 1752. 


Theorem 2. Euler’s Formula 


If G is a connected planar graph, then any plane graph depiction of G has 
r=e-—v _+ 2 regions. 


Proof 


Let us draw a plane graph depiction of G edge by edge. Let G, denote the 
(connected) plane graph obtained after n edges have been added, and let v,, 
e,, and r, denote the number of vertices, edges, and regions in G,,, respectively. 
Initially we have G,, which consists of one edge, its two end vertices, and the 
one (unbounded) region. Then e,; = 1, v, = 2, r,; = 1, and so Euler’s formula 
is valid for G,, since r; = e, — v; + 2:1 = 1 — 2 + 2. We obtain G, from 
G, by adding an edge at one of the vertices in G,. In general, G,, is obtained 
from G,-, by adding an nth edge at one of the vertices of G,_;. The new edge 
might link two vertices already in G,-_,. If it does not, the other end vertex of 
the nth edge is a new vertex that must be added to G,,. 

We will now use the method of induction (see Appendix A.2) to complete 
the proof. We have shown that the theorem is true for G;. Next we assume 
that it is true for G,-, for any n > 1, and prove it is true for G,. Let (x, y) be 
the nth edge that is added to G,_, to get G,. There are two cases to consider. 


42 Chapter 1 Elements of Graph Theory 


Figure 1.28 (a) 


In the first case, x and y are both in G,-,. Then they are on the boundary 
of a common region K of G,_;, possibly the unbounded region [if x and y were 
not on a common region, edge (x, y) could not be drawn in a planar fashion, 
as required]. See Figure 1.28a. Edge (x, y) splits K into two regions. Then 
r, =Vn-1 + 1,€, = G1 + 1, V, = Vp-1. So each side of Euler’s formula grows 
by 1. Hence if the formula was true for G,-,, it will also be true for G,. 

In the second case, one of the vertices x, y is not in G,-), say it is x. See 
Figure 1.28b. Then adding (x, y) implies that x is also added, but no new 
regions are formed (i.e., no existing regions are split). Thus r, = [,-1, €, = 
e,-, + 1, Vv, = Vn-1 + 1, and the value on each side of Euler’s formula is 
unchanged. The validity of Euler’s formula for G,,_; implies its validity for G,. 

By induction, the formula is true for all G,’s and hence true for the full 
graph G. @ 


Example 5 


How many regions would there be in a plane graph with 10 vertices each of 
degree 3? 

By the theorem in Section 1.3, the sum of the degrees, 10 X 3, equals 2e, 
and so e = 15. By Euler’s formula, the number of regions r is 


r=e-v+2=15-10+2=758 


Theorem 2 has the following corollary that can often be used to show 
quickly that a graph is nonplanar. 


Corollary 
If G is a connected planar graph with e > 1, then e = 3v — 6. 


Proof 


Let us define the degree of a region to be the number of edges on its boundary. 
To be precise, we say that if an edge occurs twice along a boundary, as does 
(x, y) in region K in Figure 1.29a, the edge is counted twice in region K’s 
degree; for example, region K has degree 10 and region L has degree 3 in 
Figure 1.29a. Observe that each region in a plane graph must have degree = 
3, for a region of degree 2 would be bounded by 2 edges joining the same pair 
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Figure 1.29 (a) 


of vertices and a region of degree 1 would be bounded by a loop edge (see 
Figure 1.29b), but parallel edges and loops are not allowed in graphs. 

Since each region has degree = 3, the sum of the degrees of all regions 
will be at least 3r. But this sum of degrees of all regions must equal 2e, since 
this sum counts each edge twice, that is, each of an edge’s two sides is part 
of some boundary (this is the same type of argument as used to show that the 
sum of the vertices’ degrees equals 2e). Thus, 2e = (sum of regions’ degrees) 
= 3r, or fe = r. Combining this inequality with Euler’s formula (Theorem 2), 


we have 
fe =>r=e-—v+2 or O=te-vi2 


Solving for e, we obtain e = 3v — 6. @ 


Example 6 


Use the corollary to prove that Ks, the complete graph on 5 vertices, is nonplanar. 

The graph K; has v = 5 and e = 10 (see Example 2 in Section 1.3 for 
how to find the number of edges in a complete graph). Then 3v — 6 = 3 X 
5 — 6 = 9. But the corollary says that e <= 3v — 6 must be true in a connected 
planar graph, and so K; cannot be planar. = 


The corollary should not be misinterpreted to mean that if e = 3v — 6, 
then a connected graph is planar. Many nonplanar graphs also satisfy this 
inequality. For example, K33 with v = 6 and e = 9 satisfies it. 

Our two theorems and the corollary have laid the foundation for a mathemat- 
ical theory of planar graphs. In the process, we have acquired a practical aid 
for showing that a graph is nonplanar. One way to extend this theory is to 
make the inequality ‘‘stronger,’’ that is, get a smaller upper bound on e. Recall 
that the key step in proving the corollary was the observation that every region 
has degree at least 3. This led to the inequality 2e = 3r. Suppose that a 
certain connected graph G (with at least 2 edges) is known to be bipartite. By 
Theorem 2 in Section 1.3, all the circuits in a bipartite graph have even 
length. Then no region in this graph can have degree 3 (since this would imply 
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a boundary circuit of length 3). Then every region in a bipartite planar graph 
must have degree = 4. Summing the degrees of all regions, we now obtain 
the inequality 2e = (sum of degrees of regions) = 4r. Reworking the corollary 
with this inequality 2e = 4r, we have 7#e = r = e — v + 2 and 


hence 
ex2v-4 


Every connected planar graph that is bipartite must satisfy this inequality. 
Consider our ‘‘favorite’’ bipartite graph K33. K;3 has v = 6 and e = 9 and, as 
noted above, satisfies the corollary inequality e = 3v — 6. But since it is 
bipartite, K;; would also have to satisfy the new inequality e = 2v — 4 if it 
were planar. It does not: 9 # 2 X 6 — 4. 


1.4 EXERCISES 


Summary of Exercises The first six exercises involve 
determining whether various graphs are planar and drawing planar graphs in 
different ways. Exercise 8 involves duality. Exercises 12-24 build on Euler’s 
formula and the corollary e < 3v—6. The other exercises introduce new con- 
cepts, 


1. Draw a dual graph of the planar graph in (include a vertex for the unbounded 
region outside the graph): 
(a) Figure 1.21b (b) Figure 1.225 

2. Show that K; is nonplanar by the method in Example 2. 


3. Which of the following graphs are planar? Find K;; configurations in the 
nonplanar graphs. 


(a) (b) (c) 


4 
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. Make a graph with one vertex for each square on a 4 X 4 chessboard and 


an edge between two vertices that correspond to squares that can be the 
start and end of a knight’s move (go two squares in one direction and then 
one square to either side). Is this graph planar? 


. (a) For what values of n is K, planar? 


(b) For what values of r and s is the complete bipartite graph K,., planar? 
(K,,; is a bipartite graph with r vertices on the left side and s vertices 
on the right side and edges between all pairs of left and right vertices). 


. Re-draw the graph in Figure 1.23b so that the infinite region is bounded 


by the circuit a-f-c-h-a. 


. In each case, give the values of r, e, or v (whichever is not given) assuming 


that the graph is planar. Then either draw a connected, planar graph with 
the property, if possible, or explain why no such planar graph can exist. 


(a) 7 vertices and 13 edges (e) 5 regions and 10 edges 

(b) 6 vertices and 8 regions (f) 6 vertices all of degree 4 

(c) 13 edges and 9 regions (g) 7 vertices all of degree 3 

(d) 6 vertices and 14 edges (h) 5 regions all with 4 boundary edges 


. The construction of a dual D(G) can be applied to any plane graph G: 


Draw a vertex of D(G) in the middle of each region of G and draw an 
edge e* of D(G) perpendicular to each edge e of G; e* connects the vertices 
of D(G) representing the regions on either side of e. 

(a) A dual need not be a graph. It might have two edges between the same 
pair of vertices or a self-loop edge (from a vertex to itself). Find two 
planar graphs with duals that are not graphs because they contain these 
two forbidden situations. 


(b) Show that the duals of the two different plane depictions of the graph 
in Figures 1.27a and 1.27c are isomorphic. 

(c) Show that the degree of a vertex in the dual graph D(G) equals the 
number of boundary edges of the corresponding region in the planar 
graph G. 

(d) Find a planar graph that is isomorphic to its own dual. 

(e) Show for any plane depiction of a graph G that the vertices of G 
correspond to regions in D(G). 


- (a) Show that if two regions R,, R, in a planar graph that have a common 


edge both have an even number of boundary edges, then the circuit 
of edges that encloses the two regions has even length. 
(b) Show that if a circuit in a planar graph encloses a collection of regions, 


each of which has an even number of boundary edges, then the circuit 
has even length. 


. The crossing number c(G) of a graph G is the minimum number of pairs 


of crossing edges in a depiction of G. For example, if G is planar then 
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c(G) = 0. Determine c(G) for the following graphs: 
(a) K33 (b) K; (c) Figure 1.26a. 
(d) An 8-gon with diagonals joining opposite pairs of corners. 


. A graph G is critical nonplanar if G is nonplanar but any subgraph obtained 


by removing a vertex is planar. 
(a) Which of the following graphs are critical planar? 
(i) K33 (ii) K; (iii) Figure 1.26a. 


(b) Show that critical planar graphs must be connected and cannot have 
a vertex whose removal disconnects the graph. 


. If a connected planar graph with n vertices all of degree 4 has 10 regions, 


determine n. 


. Suppose a planar graph is not connected but instead consists of several 


components, that is, disjoint connected subgraphs. 


(a) Find the appropriate modification of Euler’s formula for a planar graph 
with ¢ components. 


(b) Show that the corollary is valid for unconnected planar graphs. 


. Prove that if_a graph G has at least 11 vertices, then either G or its 


complement G must be nonplanar. 


Mimic the argument in the corollary to prove that e = 3r — 6 in a planar 
graph with each vertex of degree =3. 


. (a) Prove that every connected planar graph has a vertex of degree at most 


5. Hint: Assume that every vertex has degree at least 6 and obtain a 
contradiction. 


(b) Show that part (a) immediately generalizes to any (unconnected) planar 
graph. 


. Prove that every connected planar graph with less than 12 vertices has a 


vertex of degree at most 4. [Hint: Assume that every vertex has degree at 
least 5 to obtain a lower bound on e (together with the upper bound on e 
in the corollary) that implies v = 12.] 


. If G is a connected planar graph with all circuits of length at least k, show 


that the inequality e < 3v — 6 can be strengthened to e = “Ss (v—2) 


. (a) Show that every circuit in the graph in Exercise 3(a) has at least 5 


edges. 


(b) Use part (a) and the result of Exercise 18 to show that this graph is 
nonplanar. 


(a) Give an example of a graph with regions consisting solely of squares 
(regions bounded by 4 edges) and hexagons, and with vertices of degree 
at least 3. 


25. 
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(b) Use Exercise 15 to show that any graph of the sort defined in part (a) 
has at least 6 squares. 


(c) If each vertex has degree 3, show that any graph of the sort defined 
in part (a) has exactly 6 squares. 


. A graph G is called maximal planar if G is planar but the addition of 


another edge will make the graph nonplanar. 

(a) Show that every region of a connected, maximal planar graph will be 
triangular. 

(b) If aconnected, maximal planar graph has n vertices, how many regions 
and edges does it have? 


. A Platonic graph is a planar graph in which all vertices have the same 


degree d, and all regions have the same number of bounding edges d,, 

where d, = 3 and d, = 3. A Platonic graph is the ‘‘skeleton’’ of a Platonic 

solid, for example, an octahedron. 

(a) If G is a Platonic graph with vertex and face degrees d, and dy, 
respectively, then show that e = $d,v and r = (d,/d,)v. 

(b) Using part (a) and Euler’s formula, show that v(2d, + 2d, — d,d,) = 
4d). 

(c) Since v and 4d, are positive integers, we conclude from part (b) that 
2d, + 2d, — d,d, > 0. Use this inequality to prove that (d; — 2)(d, — 
2) < 4. 

(d) From part (c), find the five possible pairs of positive (integral) values 
of di, d. 


. Suppose / lines are drawn through a circle and these lines form p points 


of intersection (involving exactly two lines at each intersection). How many 
regions r are formed inside the circle by these lines? Assume that the lines 
end at the edge of the circle at 2/ distinct points. 


. Consider an overlapping set of 4 circles A, B, C, D. One would like to 


position the circles so that every possible subset of the circles forms a 
region, ¢.g., 4 regions each contained in just one (different) circle, 6 regions 
formed by the intersection of 2 circles (AB, AC, AD, BC, BD, CD), 4 
regions formed by the intersection of 3 of the 4 circles, and 1 region formed 
by the intersection of all 4 circles. Prove that it is not possible to have 
such a set of 15 bounded regions. 


A line graph L(G) of a graph G has a vertex of L(G) for each edge of G 
and an edge of L(G) joining each pair of vertices corresponding to two 
edges in G with a common end vertex. 


(a) Show that L(K;) is nonplanar. 
(b) Find a planar graph whose line graph is nonplanar. 
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26. Show that the following graphs can be drawn on the surface of a doughnut 
(torus) without crossing edges: 


(a) K33 (b) Ks (c) K, (d) K; (tricky) 


(1.5 SUMMARY AND REFERENCES 


This chapter introduced graphs, their applications, and some of their basic 
structures. This text takes a user-oriented approach to graph theory. Readers 
interested in a more formal graph theory text presenting the subject as an 
interesting area of pure mathematics should see the books by Bondy and Murty 
{1], Harary [4], or Wilson [5]. Section 1.1 introduced a set of illustrative graph 
models. The basic structure of graphs was explored in Section 1.2 under the 
guise of determining what makes two graphs different. Section 1.3 presented 
some useful edge-counting results. The final section introduced the important 
class of planar graphs. It surveyed ad hoc and theoretical approaches for 
determining whether a graph is planar. 

The history of graph theory begins with the work of L. Euler in 1736 on 
Euler circuits (discussed in Section 2.1). Euler’s formula for planar graphs, 
originally stated in terms of polyhedra, was proved in 1752. Bits of graph 
theory appeared in papers about topology and geometric games, but it was not 
until around 1850 that formal studies of graphs began to appear. One was A. 
Cayley’s 1857 paper counting the number of trees (discussed in Chapter 3). 
Another was G. Kirchhoff ’s 1847 paper presenting an algebra of circuits and 
introducing graphs in the study of electrical circuits. This same paper contains 
Kirchhoff ’s famous current and voltage laws (Kirchhoff was 21 when he wrote 
this historic paper). The term graph was first used by J. Sylvester in 1877. The 
first book on graph theory, by D. Konig, did not appear until 1936. An excellent 
sourcebook on the history of graph theory is Graph Theory 1736-1936 by 
Biggs, Lloyd, and Wilson [3]. 

See the General References (at the end of the book) for a list of other 
introductory texts on graph theory. 
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SUPPLEMENT I: 
REPRESENTING GRAPHS INSIDE A COMPUTER 


This supplement is meant as a brief overview. For a fuller discussion of data struc- 
tures, the reader should consult a data structures text, such as A. Aho, J. Hopcroft, 
J. Ullman, Data Structures and Algorithms, Addison-Wesley, Reading, MA, 1983, or 
R. Sedgewick, Algorithms in C, Addison-Wesley, Reading, MA, 1990. 


Graph problems that involve large graphs or that must be solved repeatedly 
need to be programmed for computer solution. Unlike numerical computations, 
such as inverting a matrix or numerically approximating the solution of a 
differential equation, most graph problems require no arithmetic operations. 
Instead, the essence of these problems is systematic search and the use of the 
right data structures for organizing the information about vertices and edges 
of a graph inside a computer. In this supplement, we present the most common 
ways of representing graphs in a computer. 

The simplest but least efficient way to represent a graph is with an adjacency 
matrix. This matrix has a row and a column for each vertex. Entry (i, j) = 1 
if vertex x; is adjacent to vertex x; and entry (i, /) = 0 otherwise. Figure 1.30b 
shows the adjacency matrix for the graph in Figure 1.30a. In a directed graph, 
entry (i, j) = 1 if and only if there is edge (x;; x;). Because the adjacency 
matrix often has Os in most entries, it is an inefficient way to represent a graph. 

A better approach is to store only those adjacencies that do occur. A list 
of all the edges is compact but it requires extensive searching to determine the 
set of vertices adjacent to a given vertex. A compromise between an adjacency 
matrix and an edge list is a set of adjacency lists—for each vertex, we have a 


a bc def 

a 0 1 1 0 0 0 

g 7 i bod. OE A 2 0 

e 1 1 0 1 21 90 

d@d 0 1 1 0 1 21 

a c e e 0 O 1 1 0 1 
f 9 0 0 1 1 +0 


Figure 1.30 (a) (b) 
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list of the vertices adjacent to it. Figure 1.31 contains the adjacency lists for 
the graph in Figure 1.30a. The first element in each list is the length of the 
list. 

It should be noted that these adjacency lists can often be quite wasteful of 
space if one does not know in advance how long an adjacency list might be. 
In some graph problems the number of adjacencies will vary over time. Then 
each list may initially have to be given a length equal to the maximum possible 
number of adjacencies at a vertex (for an n-vertex graph, the maximum possible 
number of adjacencies is n — 1). 

The following more sophisticated data structure, called a linked list, effi- 
ciently handles this difficulty. A linked list is a set of arrays. Figure 1.32 has 
a linked list for the graph in Figure 1.30a. The primary array contains the 
adjacency lists of each vertex, but these lists are not stored in order. The 
secondary, or pointer, array contains ‘‘pointers’’ telling where the next entry 
in an adjacency list appears in the primary array. For example, if entry 17 
in the main array contains the name of one of the vertices adjacent to c, 
then entry 17 in the pointer array has the entry number in the primary array 
of the next vertex in c’s adjacency list. In Figure 1.32, entry 17 in the pointer 
array is 21, telling us that the next vertex in c’s adjacency list is in entry 21 
of the primary array. Entry 21 in the primary array is the last vertex in c’s 
adjacency list, and this is indicated by having entry 21 in the pointer array 
blank. 

There are two other arrays in this linked list. We need an array, called the 
Index Array, telling us where the first entry in each adjacency list appears (in 
the primary array). We also maintain an array, called the Unused Entry Array, 
with the numbers of entries not currently being used. The reason for this array 
will be explained shortly. 

Vertex c’s adjacency list is stored in entries 3 > 4 — 17 — 21. Suppose 
we want to delete the adjacency in entry 17 of the primary array. Then we 
change entry 4 in the pointer array from 17 to 21, and now c’s adjacency list 
is stored in entries 3 > 4 — 21. We also place the number 17 in the Unused 
Entries array. 

On the other hand, if we wanted to add a new entry in c’s original adjacency 
list following entry 4, then we get the location of an unused entry from the 
Unused Entry Array, say entry 23, and change entry 4 in the pointer array from 
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4 b 17 Index Array 
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10 f _ e 13 
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12 d 8 

a y ae Unused Entry 
15 _ _ Array 
16 d — 5 
17 d 21 15 
18 e 14 20 
19 c 18 9 
20 = = 23 
21 e — 
22 d 10 


Figure 1.32 23 


17 to 23. We store the name of the new vertex adjacent to c in entry 23 of the 
primary array and set entry 23 in the pointer array to 17. Now c’s adjacency 
list is stored in entries 3 > 4 > 23 — 17 > 21. 


SUPPLEMENT II: 
SUPPLEMENTARY EXERCISES 


Summary of Exercises Graph theory is a field famous for its 
interesting problems. Several exercises introduce new graph concepts, such as 
strong connectedness (Exercise 12) and cut-set (Exercise 18). Exercise 22 is a 
very famous problem in Ramsey Theory (see Appendix A.4). For more problems 
in graph theory, see any of the graph theory texts listed in the References at 
the end of this text. 


1. Suppose that there are seven committees with each pair of committees 
having a common member and each person being on two committees. How 
many people are there? 
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. Show that the complement of K,, a complete graph on 7 vertices, is a set 


of n isolated vertices. 


. A graph is regular if all vertices have the same degree. If a graph with n 


vertices is regular of degree 3 and has 18 edges, determine n. 


4. If a graph has 50 edges, what is the least number of vertices it can have? 


. If every vertex in a directed graph G has positive out-degree (at least one 


outwardly directed edge), then: 
(a) Must G contain a directed circuit? 
(b) Must every vertex of G be on a directed circuit? 


. Show that at least two vertices have the same degree in any graph with at 


least two vertices. (Hint: Be careful about vertices of degree 0.) 


. Show that an undirected graph with all vertices of degree =2 must contain 


a circuit (edges cannot be repeated in a circuit). 


. If every vertex in a graph G has degree = d, then show that G must contain 


a circuit of length at least d + 1. 


Show that if a graph is not connected (connected means each pair of vertices 
is joined by a path), then its complement must be connected. 


If there is no path between every pair of vertices in an (undirected) graph, 
then the graph partitions into a set of components, with two vertices joined 
by a path if and only if the vertices are in the same component. Let G be 
an undirected graph G with at least two components. Show that if G contains 
exactly two vertices of odd degree, then they must be in the same component 
of G. 


. Show that removal of some vertex x disconnects the connected undirected 


graph G if and only if there are two vertices a and b in G such that all 
paths in G from a to b pass through x. 


. A directed graph G is called strongly connected if there is a directed path 


from x to y for any two vertices x, y in G. Direct the edges in the following 
graphs to make the graphs strongly connected. If not possible, explain why. 
(An application of this problem is making streets in a city one-way.). 


(a) Figure 1.5 (b) Figure 1.6 (c) Figure 1.10 (left graph) 


. Prove that G is strongly connected (see Exercise 12) if and only if G’s 


vertices cannot be partitioned into two sets Vi, V, such that there are no 
edges from a vertex in V, to a vertex in V>. 


. A bridge is an edge whose removal disconnects G. Prove that the edges 


of a connected undirected graph G can be directed to create a strongly 
connected graph (see Exercise 12) if and only if there is no bridge in G. 


. Draw planar graphs with the following types of vertices, if possible: 


(a) Six vertices of degree 3. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
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(b) i vertices of degree i, i = 1, 2, 3, 4, 5. 
(c) Two vertices of degree 3 and four vertices of degree 5. 


. Suppose that there are three farms each with a child, a goat, and a rabbit. 


Arbitrarily name the three sets of inhabitants C,, C,, C., Ga, Gp, G., and 

R,, R,, R.. Use the following information and a graph model to determine 

the three groups of a child, goat, and rabbit on each farm: 

(a) The male child on the farm with goat G, and the male child on the 
farm with rabbit R, are competing for the attention of the female child 
C, on the third farm. 


(b) Goat G, and rabbit R, are not on the same farm. 

(c) The boy on the farm with rabbit R, is not C,. 

A trail is a sequence of consecutively linked edges in which no edge can 
appear more than once. Unlike a path, a vertex can be visited any number 


of times in a trail. A cycle is a trail that starts and ends at the same vertex. 
A cycle that repeats no vertices is a circuit. 


(a) Show that a subset of the vertices on a trail from x to y can be used 
to make a path from x to y. 

(b) Prove or give a counterexample: If x and y lie on a cycle, then they 
must lie on a circuit. 

(c) Show that the edges in a cycle can be partitioned into a collection of 
circuits. 


(d) Show that if C is an odd-length cycle, then a subset of C’s edges forms 
an odd-length circuit. 


A cut-set S is a set of edges in a connected undirected graph G whose 

removal disconnects G, but no proper subset of § can disconnect G. 

(a) Find a cut-set of minimal size in Figure 1.12. 

(b) Show that every cut-set has an even number of edges in common with 
any circuit (remember that 0 is an even number). 

A graph with n vertices and n + 2 edges must contain two edge-disjoint 

circuits. Prove or give a counterexample. 


Show that if an n-vertex graph has more than 3(n—1)(n —2) edges, then 
it must be connected. (Hint: The most edges possible in a disconnected 
graph will occur when there are two components, each complete subgraphs.) 


Show that an -vertex graph cannot be a bipartite graph if it has more 47° 
edges. 


(Famous Ramsey Theory problem) Let each edge of a complete graph 
on 6 vertices be painted red or white. Show that there must always be 
either a red triangle of 3 edges or a white triangle of 3 edges. 


(a) Find a graph that is isomorphic to its own complement. 
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(b) Show that any self-complementary graph [as in part (a)] must have 
either 4k or 4k + 1 vertices, for some integer k. (Hint: Use the fact 
that G and G both must have the same number of edges.) 


. Suppose that each path in a certain 7-vertex planar graph contains an even 


number of edges (0 edges or 2 edges or 4 edges, etc.). Draw the graph. 
(Hint: This is a ‘‘trick’’ problem.) 


. Show that a directed graph has no directed circuits if and only if its vertices 


can be indexed x;, %5,..., %, So that all edges are of the form (x;; x), i 
a F 


. (a) A clique is a complete subgraph not contained as a subset of a larger 


complete subgraph. Find all cliques in the following interval graphs: 
(i) Figure 1.3 (ii) Figure 1.7a 

(b) Show that the cliques in an interval graph correspond to points of 
maximal overlap in an interval model and that when cliques are indexed 
according to the order of the points in an interval model, each vertex 
will be in a consecutive set of cliques. 


. A line graph L(G) of a graph G has a vertex of L(G) for each edge in G 


and an edge between 2 vertices in L(G) corresponding to 2 edges of G 
with a common end vertex. 


(a) Draw a line graph of the left graph in Figure 1.10. 
(b) Show that each vertex in L(K,) has degree 2(n — 2). 
(c) Find all graphs that are isomorphic to their own line graph. 


. Show that if a graph H is the line graph (see Exercise 27) of some graph, 


then the edges of H can be partitioned into a collection of complete sub- 
graphs such that each vertex of H is in exactly two such complete subgraphs. 


. A dominating set is a set of vertices that collectively are adjacent to all 


other vertices in a graph. Draw a graph whose smallest dominating set is 
not an independent set (independent sets were defined in Example 5 of 
Section 1.1). 


. An automorphism of a graph is an isomorphism (1 — 1 mapping preserving 


adjacency) of the vertices of a graph with themselves. Find all automor- 
phisms of the graph in 


(a) Figure 1.la (b) Figure 1.5 (c) Figure 1.12 


. At a complicated intersection with special signals for left turns and so 


forth, each particular signal has an associated arc on the signal timing cycle 
when the signal is green. A general approach to constructing a timing cycle 
is first to make a graph with a vertex for each signal and an edge between 
two vertices (signals) that control noncrossing streams of traffic. Then we 
try to build a family of arcs on a timing cycle such that if two arcs overlap 
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then they must represent adjacent vertices. Build the graph and some proper 
arc model for this intersection. 


32. (a) Show that there is no way to pair off the 14 vertices in the graph below 
with 7 edges. 


(b) Generalize part (a) to the problem of trying to use 31 dominoes to 
cover the 62 squares of an 8 X 8 chessboard with two opposite corner 
squares removed. 


33. A round-robin tournament can be represented by a complete directed graph 
with vertices for competitors and an edge (a; b) if a beats b. Each competitor 
plays every other competitor once. A vertex’s (competitor’s) score will be 
its out-degree (number of victories). Show that if vertex x has a maximum 
score among vertices in a round-robin tournament, then for any other vertex 
y, either there is an edge (x; y) or for some w there are edges (x; w) and 
(w, y). 

34, Suppose the round-robin tournament graph (see Exercise 33) has no directed 
circuits. We define a ranking of vertices (competitors) as follows. A vertex 
with no outward edge has a rank of 0. In general, a vertex has rank k if it 
has an edge directed to a rank k—1 vertex and all other edges directed to 
lower ranks (= k—1). Show that a directed complete graph with no directed 
circuits always has such a ranking and that each vertex will have a different 
rank. 


35. Suppose circuits C, and C, have common edges (but C, # C,). Show that 
the edges in (C,UC,) — (C{\NC,) form a circuit (or collection of circuits). 


36. If the graph G has 2n vertices and no triangles, then show that G cannot 
have more than n? edges. 


— ey 


CHAPTER 2 


COVERING CIRCUITS 
AND GRAPH COLORING 


2.1 EULER CYCLES 


In this chapter we examine two graph-theoretic concepts that have many 
important applications. One is covering edge sequences, which traverse all the 
edges in a graph once or which visit all the vertices once. The other is graph 
coloring, a concept introduced in Example | of Section 1.4. 

Recall that a circuit cannot visit a vertex more than once. We need to 
define a new term for a circuit-like sequence of edges that may repeat a vertex. 
A cycle is a sequence of consecutively linked edges ((X1, x2), (%2, 3), - +», Qn-1 
X,)) Whose starting vertex is the ending vertex (x, = x,) and in which no edge 
can appear more than once. A vertex can be visited any number of times in a 
cycle. An Euler cycle is a cycle that contains all the edges in a graph (and 
visits each vertex at least once). 

In some applications of Euler cycles, such as a highway network linking 
a group of cities, it is natural to permit several edges to join the same pair of 
vertices. Such generalized graphs are called multigraphs; loop edges, of the 
form (a, a), are also allowed in multigraphs. The graph in Figure 2.1 is a 
multigraph. 


Example 1 


The old Prussian city of Konigsberg was located on the banks of the Pregel 
River. Part of the city was on two islands that were joined to the banks and 


A 


(a) (b) 
Figure 2.1 
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each other by seven bridges, as shown in Figure 2.1a. The townspeople liked 
to take walks, or Spaziergangen, about the town across the bridges. Several 
people were apparently bothered by the fact that no one could figure out a walk 
that crossed each bridge just once, for they brought this problem to the attention 
of the famous mathematician Leonhard Euler. He solved the Spaziergangen 
problem, thereby giving birth to graph theory and immortalizing the Seven 
Bridges of Konigsberg in mathematics texts. 

We can model this walk problem with a multigraph having a vertex for 
each body of land and an edge for each bridge. See Figure 2.1b. The desired 
type of walk corresponds to what we now call an Euler cycle. Readers should 
convince themselves that no Euler cycle exists for this multigraph. « 


A multigraph possessing an Euler cycle will have to have an even degree 
at each vertex, since each time the cycle passes through a vertex it uses two 
edges. A second obvious property is that the multigraph must be connected. 
Euler showed that these two properties are also sufficient to guarantee the 
existence of an Euler cycle. We will prove this theorem for undirected 
multigraphs. Let us first build an Euler cycle by ad hoc methods in a graph 
that is connected and has even-degree vertices. Then we extend our construction 
to a general proof of the Euler cycle theorem. 


Example 2 


Build an Euler cycle for the graph in Figure 2.2a@ that is connected and has 
even-degree vertices. 

Let us start by blindly tracing out a trail from vertex a. Suppose we go a- 
d-j-n-o-k-l-h-f-e-b-a. Now we are back to a. Note that the even degree property 
means that we can always leave any vertex we enter except a. There are no 
vertices of degree 1 or that are reduced to degree 1 after being visited several 
times. Thus, any trail we trace from a will never be forced to end at another 
vertex and so must eventually come back to a, forming a cycle; in this case 
the cycle is a circuit (no vertex is visited twice). 

Next consider the graph of remaining edges, shown in Figure 2.2b. It is 
no longer connected, but all vertices still have even degree (removing the cycle 


c d e & 
[| | 
i J k 

(b) 


Figure 2.2 
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reduced each degree by an even amount). Each connected piece of the remaining 
graph has an Euler cycle: d-c-i-j-k-e-d and h-g-m-h. These two cycles can be 
inserted into the original cycle at vertices d and A, respectively, yielding the 
Euler cycle a-d-c-i-j-k-e-d-j-n-o-k-l-h-g-m-h-f-e-b-a. a 


Theorem 


An undirected multigraph has an Euler cycle if and only if it is connected and 
has all vertices of even degree. 


Proof 


The proof generalizes the construction in Example 2. As noted above, when 
an Euler cycle exists, the multigraph must be connected and have all vertices 
of even degree. 

Suppose a multigraph G satisfies these two conditions. Then pick any vertex 
a and trace out a trail. As in Example 2, the even degree condition means that 
we are never forced to stop at some other vertex, and so eventually the trail 
must terminate at a (possibly it passes through a several times before finally 
stopping at a). Let C be the cycle thus generated and let G’ be the multigraph 
consisting of the remaining edges of G-C. As in Example 2, G’ may not be 
connected but it must have all vertices of even degree. 

Since the original graph was connected, C and G’ must have a common 
vertex or else there is no path from vertices in C to vertices in G’. Let a’ be 
such a common vertex. Now build a cycle C’ tracing through G’ from a’ just 
as C was traced in G from a. Incorporate C’ into the cycle C at a’, as done in 
Example 2, to obtain a new larger cycle C’. Repeat this process by tracing a 
cycle in the graph G” of still remaining edges and incorporate the cycle into 
C’ to obtain C”. Continue until there are no remaining edges. @ 


Let us now consider an application of Euler cycles to an urban systems 
analysis problem. ; 


Example 3 


Suppose the graph of solid edges in Figure 2.3 represents a collection of blocks 
to be swept by a street sweeper in a certain district of some city from 10 aM. 
to 11 a.m. (when parking on these blocks is forbidden). The looping edge at k 
represents a circle. We want a tour that sweeps each solid edge once. That is, 
we want an Euler cycle of the solid edges. Unfortunately, the graph of edges 
to be swept in such applications rarely has all vertices of even degree. Frequently 
the graph is also not connected. In such cases we must traverse extra edges, 
called deadheading edges, to obtain the desired tour. This creates anew problem: 
minimizing the number of deadheading edges. 
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Figure 2.3 


The desired tour will be an Euler cycle of the multigraph of sweeping and 
deadheading edges—a multigraph because some edges may be repeated in 
deadheading. By the theorem, this multigraph must be connected and have all 
vertices of even degree. Thus to minimize deadheading, the deadheading edges 
should be a minimal set of extra edges that when added to the original graph 
make all vertices of even degree and the new multigraph connected. Suppose 
the dashed edges in Figure 2.3 are such a minimal set of extra edges. 

There is one more problem to be faced once we have a multigraph possessing 
an Euler cycle. We want to build an Euler cycle that minimizes turns at corners, 
especially U-turns. Even right-hand turns can tie up traffic in busy cities. We 
can successively look at each corner (vertex) and pair off the edges at the 
corer so as to minimize disruption on the tour’s passes through that corner. 
For the multigraph in Figure 2.3, we would go straight through corners d, e, 
h, i on visits to these corners; we would go straight through j once and turn 
once [going between (f, /) and (, /)]; we must turn both times at k; and at all 
other corners (of degree 2) we would have forced turns. The short lines near 
each vertex in Figure 2.3 indicate these edge pairings. Of course, such edge 
pairings are very unlikely to produce a single Euler cycle. In Figure 2.3, we 
get two cycles from these pairings: a-b-e-i-m-I-h-d-a and c-d-e-f-j-k-k-j-i-h-g- 
c. Now we break the two cycles at a common vertex and fuse them together. 
For example, at d, change the edge pairings to 


The result is an Euler cycle with optimal pairings at all but one corner. 

A large street network with 200 edges typically forms only three or four 
cycles when minimal-disruption edge pairing is performed at each corner, and 
so only a few pairings need to be changed. In practice, it is also necessary to 
use directed graphs since street sweepers cannot move against the traffic on 
their side (curb) of the street. = 
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It is interesting to note that the street sweeping problem gives rise to an 
alternative proof of the Euler cycle theorem: 


Alternative Proof of Theorem 


The even-degree condition means that we can pair off (and link together) the 
edges at each vertex. These linked edges form a set of cycles. The connectedness 
condition means that these cycles have common vertices where they can be 
joined to form a single cycle—an Euler cycle. 


This proof is not as intuitive or pictorial as our original path-tracing method, 
but it is both simpler and more applicable. 

We conclude this section by extending the concept of an Euler cycle to an 
Euler trail. A trail is a sequence of consecutively linked edges in which no 
edge can appear more than once. A trail bears the same relation to a path that 
a cycle bears to a circuit. Both allow repeated vertices. Note that a cycle is a 
trail, but most trails have different starting and ending vertices. An Euler trail 
is a trail that contains all the edges in a graph (and visits each vertex at least 
once). 


Corollary 


A multigraph has an Euler trail, but not an Euler cycle, if and only if it is 
connected and has exactly two vertices of odd degree. 


Proof 


Suppose a multigraph G has an Euler trail but not an Euler cycle. Call it T. 
Then the starting and ending vertices of T must have odd degree while all other 
vertices have even degree (by the same reasoning that showed all vertices have 
even degree in an Euler cycle). Also the graph must be connected. 

On the other hand, suppose that a multigraph G is connected and has 
exactly two vertices, p and q, of odd degree. Add a supplementary edge (p, 
q) to G to obtain the graph G’. G’ is connected and has all vertices of even 
degree. Hence by the Euler cycle theorem, G’ has an Euler cycle, call it C. 
Now remove the edge (p, g) from C. This removal reduces the Euler cycle to 
an Euler trail that includes all edges of G. 


2.1 EXERCISES 


Me Summary of Exercises The first four exercises involve trying 


to build Euler cycles. Exercises 5~8 and 11-14 present extensions and other 
questions related to the Euler cycle theorem. The remaining questions involve 
some modeling and further theory, and the last two exercises ask for computer 
programs. 
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. (a) Build an Euler cycle for the right graph in Figure 1.12. 


(b) Build an Euler trail for the graph in Figure 1.26a with edge (d, g) 
removed. 


. (a) For which values of n does K,, the complete graph on n vertices, have 


an Euler cycle? 
(b) Are there any K, that have Euler trails but not Euler cycles? 


. Suppose a new artificial island were placed in the Pregel River in between 


islands B and C in Figure 2.1a. If the bridge from B to C is removed and 
several bridges built connecting the new island to all other bodies of land 
(any number of bridges to other bodies of land allowed), is it possible to 
have a Spaziergangen that is an Euler cycle? 


. What is the minimum number of times one must raise one’s pencil in order 


to draw the graph in Figure 1.5? 


. (a) Cana graph with an Euler cycle have a bridge (an edge whose removal 


disconnects the graph)? Prove or give a counterexample. 
(b) Give an example of a 10-edge graph with an Euler trail that has a 
bridge. 


Give an argument for and an argument against the statement that a 1-vertex 
graph (with no edges) has an Euler cycle. 


. Suppose that in the definition of an Euler cycle, we drop the seemingly 


superfluous requirement that the Euler cycle visit every vertex and require 
only that the cycle include every edge. Show that now the theorem is false. 
Draw a graph that illustrates why the theorem is now false. 


_ Prove that if a connected graph has a 2k vertices of odd degree, then there 


are k disjoint trails that contain all the edges. 


. The matrix below marks with a 1 each pair of the set of racers A, B, C, 


D, E, F who are to have a drag race together. It is most efficient if a racer 
can run in two races in a row (but not three in a row). Is it possible to 
design a sequence of races such that one of the racers in each race (except 
the last race) also runs in the following race (but not three in a row)? If 
possible, give the sequences of races (pairs of racers); if not, explain why 


10. 


16. 
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Is it possible for a knight to move around an 8 by 8 chessboard so that it 
makes every possible move exactly once (consider a move between two 
squares connected by a knight to be completed when the move is made in 
either direction)? 


. Show that in Example 3 the minimum set of deadheading edges in any 


sweeping problem will be a collection of edges forming paths between 
pairs of different odd-degree vertices. 


. Prove the directed version of the Euler Cycle Theorem: A directed 


multigraph has a directed Euler cycle if and only if the multigraph is 
connected (when directions are ignored) and the in-degree equals the out- 
degree at each vertex. 


(a) Model your proof after the argument in the proof of the theorem. 


(b) Model your proof after the argument in the alternate proof of the 
theorem. 


(c) Build a directed Euler cycle for the graph below. 


. State and prove a directed multigraph version of the corollary. 
. A directed graph is called strongly connected if there is a directed path 


from any given vertex to any other vertex. Show that if a directed graph 
possesses a directed Euler cycle, then it must be strongly connected. 


- Try to find a minimal set of edges in the graph below whose removal 


produces an Euler cycle. (Hint: Tricky.) 


The line graph L(G) of a graph G has a vertex for each edge of G, and 
two of these vertices are adjacent if and only if the corresponding edges 
in G have a common end vertex. 
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17. 


18. 


19. 


20. 


(a) Show that L(G) has an Euler cycle if G has an Euler cycle. 


(b) Find a graph G that has no Euler cycle but for which L(G) has an 
Euler cycle. 


Consider the following algorithm due to Fleury for building an Euler cycle, 
when one exists, in a single pass through a graph (without later adding 
side cycles as in the proof of the theorem). Starting at a chosen vertex a, 
build a cycle and erase edges after they are used (also erase vertices when 
they become isolated points). The one rule to follow in the cycle building 
is that one never chooses an edge whose erasure will disconnect the resulting 
graph of remaining edges. 


(a) Apply this algorithm to build Euler cycles for the graph in: 

(i) Figure 2.2a (ii) Figure 2.3 (including deadheading edges) 
(b) Prove that this algorithm works. 
(c) Does this algorithm work for Euler trails? Explain. 


Suppose we are given an undirected connected graph representing a network 
of two-way streets. 


(a) Show that there always exists a tour of the network in which a person 
drives along each side of every street once. 


(b) Show that the tour in part (a) can be generated by the following rule: 
At any intersection do not leave by the street first used to reach this 
intersection unless all other streets from the intersection have been 
used, 


A set of 8 binary digits (0 or 1) are equally spaced about the edge of a 
disk, We want to choose the digits so that they form a circular sequence 
in which every subsequence of length three is different. Model this problem 
with a graph with 4 vertices, one for each different subsequence of two 
binary digits. Make a directed edge for each subsequence of three digits 
whose origin is the vertex with the first two digits of the edge’s subsequence 
and whose terminus is the vertex with the last two digits of the edge’s 
subsequence. 


(a) Build this graph. 

(b) Show how an Euler cycle (which exists for this graph) will correspond 
to the desired 8-digit circular sequence. 

(c) Find such a 8-digit circular sequence with this graph model. 

(d) Repeat the problem for 4-digit binary sequences. 

Write computer programs for finding an Euler cycle, when one exists, in 

a multigraph: 

(a) Use the method in the proof of the theorem. 

(b) Repeat part (a) for a directed graph. 

(c) Use the method in the alternate proof of the theorem. 
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(d) Modify the program in parts (a) or (c) to build the k trails described 
in Exercise 8. 


21. Write a program to implement the algorithm in Exercise 17. 


2.2 HAMILTON CIRCUITS 


In this section we explore Hamilton circuits and paths, circuits and paths that 
visit each vertex in a graph exactly once. Hamilton circuits arise in operations 
research problems involving the route of a delivery truck that must supply a 
set of stores or motion planning for a drill press making holes at specified 
locations on printed circuit boards. In these applications, the most efficient 
solution will be obtained by finding a minimal-cost Hamilton circuit (this 
problem is discussed in Section 3.4). 

The problem of determining whether a graph has an Euler cycle was 
answered by the Euler cycle theorem, which tells us that a graph has an Euler 
cycle if and only if all vertices have even degree and the graph is connected. 
Such a nice, simple answer is very unusual in graph theory (this is probably 
why the Euler cycle theorem was the first result proved in the field of graph 
theory). In this section, we return to graph theory normalcy: There is no simple 
way to determine whether or not an arbitrary graph has a Hamilton circuit (or 
a Hamilton path). 

Finding Hamilton circuits by inspection, when they exist, is usually not 
too hard in moderate-sized graphs; but proving that no Hamilton circuit exists 
in a given graph can be very difficult. Such a proof typically involves the same 
type of reasoning needed to show that two graphs are not isomorphic or that 
a graph is nonplanar. At the end of this section we present a sampling of the 
theory that has been developed about the existence of Hamilton circuits. These 
results only give special conditions on a graph that guarantee the existence of 
a Hamilton circuit. Most graphs satisfy none of these conditions. 

Our focus will be proving that a Hamilton circuit does not exist in particular 
graphs. This nonexistence problem requires the type of systematic logical analy- 
sis that is the essence of most applied graph theory. To prove nonexistence, 
we must begin building parts of a Hamilton circuit and then show that the 
construction must always fail. This is similar to the way in Section 1.4 that we 
showed a graph to be nonplanar by building a circuit and then adding chords 
in a structured fashion that forced two edges to cross. 

Our analysis is built on three simple rules that must be satisfied by the 
set of edges forming a Hamilton circuit. The idea underlying these rules is 
that any Hamilton circuit must contain exactly two edges incident to each 
vertex. 
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Rule 1. Ifa vertex x has degree 2, both of the edges incident to x 
must be part of any Hamilton circuit. 


Rule 2. No proper subcircuit, that is, a circuit not containing all 
vertices, can be formed when building a Hamilton circuit. 


Rule 3. Once the Hamilton circuit is required to use two edges at a 
vertex x, all other (unused) edges incident at x can be deleted. 


Example 1 


Show that the graph in Figure 2.4 has no Hamilton circuit. 

We can apply Rule 1 at vertices a, b, d, and e. To indicate that the two 
edges at each of these vertices must be used, we draw a little line segment 
from one edge to the other, as shown in Figure 2.4. There are two types of 
contradictions that have now arisen. First, using Rule 1 for vertices a and d 
forces the Hamilton circuit to use edges (a, c), (a, d), (d, c) in forming a 
triangle; this violates Rule 2. Second, using Rule 1 for all four vertices of 
degree 2 forces the Hamilton circuit to contain more than two edges incident 
at c. Either of these difficulties proves that this graph has no Hamilton circuit. = 


In the following two examples, our three rules will be used in the following 
sort of sequential reasoning. When Rule 3 requires us to delete edges at some 
vertex g, one of q’s deleted edges may go to a vertex r of degree 3. After the 
deletion of (g, r), r has degree 2, requiring by Rule | that its remaining two 
edges be used. Possibly, one of these two edges, say (r, 5), Causes two edges 
to be used at neighboring vertex s, and so now Rule 3 can be invoked at s, 
continuing the sequence of forced moves. 


Example 2 


Show that the graph in Figure 2.5 has no Hamilton circuit. 

We can apply Rule 1 at vertices a and g, so that the subpaths b-a-c and 
e-g-i must be part of any Hamilton circuit. Next consider vertex 1. We already 
know that (g, i) must be part of the circuit. Since the graph is symmetric with 
respect to edges (i, j) and (i, k), it does not matter which of these two edges 
we choose as the other edge incident to i on the Hamilton circuit. Suppose we 
pick (i, j). If we obtain a contradiction using (7, j), then by symmetry we would 
also obtain a contradiction with (i, k). The situation can be visualized as follows: 


a b 


eee 


Figure 2.4 4 e 
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If we marked (i, j) and then held the paper with the graph in front of us and 
viewed it in a mirror, it would appear that we had chosen (i, k); every subsequent 
move starting with (i, 7) would have a mirror image move starting from (i, k). 

Having chosen to use (i, j) as the second edge at i, we delete the other 
edge at i, (i, k), by Rule 3. See Figure 2.5. Each time we apply a rule, we need 
to check how other edges and vertices are affected. Does any vertex now have 
only two remaining edges at it, allowing Rule 1 to be applied? Or are two 
edges now required to be used at some vertex, allowing Rule 3 to be applied? 
Also, is there some edge that, if used, would complete a subcircuit, allowing 
Rule 2 to be used to delete it? 

Deleting (7, k) reduces the degree of k to 2, and so Rule 1 requires that we 
use both remaining edges incident at k, (j, k) and (h, k). Edge (j, k) is the 
second edge used that is incident to j. Then by Rule 3, j’s other edge (f, /) 
must be deleted. This deletion reduces the degree of f to 2. See Figure 2.5. So 
we must use the two remaining edges at f, (b, f) and (f, e). Edge (f, e) is the 
second edge used at e, and so e’s other edges, (e, d) and (e, A), must be deleted. 
Vertex h now has degree 2 forcing the use of edge (c, h). 

Now we run into some contradictions. Looking at d, which now has degree 
2, we must use edges (b, d) and (c, d), but b and c each already have two 
edges—thus violating Rule 3. Also, we have formed the subcircuit a-b-d-c-a, 
violating Rule 2. Or if we apply Rule 1 at b and c, deleting edges (b, d) 
and (c, d), then d is left with no edges—clearly impossible. Any of these 
inconsistencies proves that this graph has no Hamilton circuit. 

Note that this graph does have a Hamilton path, for example, a-b-f-e-g-i- 
J-k-h-c-d. = 


It should be emphasized that the following is not valid reasoning to show 
that a graph has no Hamilton circuit: Start from some vertex and construct a 
route visiting successive vertices and show that several attempts to find a circuit 
through all vertices fail. Even in a simple graph like the one in Figure 2.5, 
there are hundreds of possible beginning subpaths for a Hamilton circuit that 
would all have to be checked. The approach presented here requires much less 
work than a rigorous trial-and-error effort. 
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Example 3 


Show that the graph in Figure 2.6 has no Hamilton circuit. 

Note that this graph has vertical and horizontal symmetry (although vertex 
nis off to one side, its adjacencies have a square-like symmetry). It sometimes 
takes a bit of trial-and-error experimenting to find a good vertex at which to 
start trying to build a Hamilton circuit when there are no vertices of degree 2. 
We seek a vertex with the property that once two edges are chosen at the 
vertex, then the use of Rules 1 and 3 will force the successive deletion and 
inclusion of many edges. Vertex e is such a vertex. We can either use two 
edges incident at e from opposite sides (180° apart) or use two edges incident 
at e that form a 90° angle. We must examine both cases to show that no 
Hamilton circuit can exist. 


Case I Suppose we use two edges incident at e from opposite sides. By 
symmetry, they can either be edges from d and f or from b and h. Suppose we 
choose (d, e) and (e, f). Then by Rule 3, we can delete edges (e, b) and 
(e, h). Then at b and at h we must use both remaining edges, getting subpaths 
a-b-c and g-h-i. Now at d we can use either edge (d, a) or edge (d, g). The 
two cases are symmetrical with respect to the edges chosen for the circuit thus 
far. So without loss of generality, we can choose edge (d, a). 

At f, we cannot use (f, c) or else subcircuit a-b-c-f-e-d-a results. So we 
must use (f, i). See Figure 2.6. Since we have used two edges at vertices a 
and i, the other edges at these vertices can be deleted by Rule 3. We now 
obtain several inconsistencies. Vertices j, k, 1, and m each now have degree 2. 
But each is incident to n, and so using the two remaining edges at each causes 
4 edges to be used at n. We conclude that there is no Hamilton circuit in 
Case I. 


Case II Suppose we use two edges incident at e that form a 90° angle. By 
symmetry it does not matter which of the four 90° angle pairs of edges we 
choose. Suppose we choose (b, e) and (d, e). See Figure 2.7. Then by Rule 3 
we can delete edges (e, f) and (e, h). Then at fand h we must use both remaining 


Figure 2.6 
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Figure 2.7 


edges getting subpaths c-f-i and g-h-i. See Figure 2.7. By Rule 3 at i, we can 
delete edges (i, k) and (i, /). By Rule 1, we must use the remaining edges at 
k and J, forming the subcircuit i-f-c-k-n-l-g-h-i. 

Having obtained contradictions in both cases, we have proved that the 
graph has no Hamilton circuit. =» 


We now present a few of the theoretical results about the existence of 
Hamilton circuits and paths. (See [4] for proofs of Theorems 1, 2, and 3.) 


Theorem 1 


A connected graph with n vertices, n > 2, has a Hamilton circuit if the degree 
of each vertex is at least n/2. 


Theorem 2 


Let G be a connected graph with n vertices, and let the vertices be indexed x,, 
Xz, ...,X,, SO that deg(x,) = deg(x;.,). If for each k = n/2, either deg(x,) > k 
or deg(x,-,) = n—k, then G has a Hamilton circuit. 


Theorem 3 


Suppose a planar graph G has a Hamilton circuit H. Let G be drawn with any 
planar depiction, and let r; denote the number of regions inside the Hamilton 
circuit bounded by i edges in this depiction. Let r/ be the number of regions 
outside the circuit bounded by i edges. Then the numbers 7; and r} satisfy the 
equation 


> @~ 27; - r/) =0 e) 


Just as the inequality e = 3v — 6 for planar graphs in the corollary 
in Section 1.4 could be used to prove that some graphs are not planar, 
Theorem 3 can be used to show that some planar graphs cannot have Hamilton 
circuits. 
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Example 4 


Show that the planar graph in Figure 2.8 has no Hamilton circuit. 

We have indicated the number of bounding edges inside each of the regions 
in the planar depiction of the graph in Figure 2.8. There are three regions with 
four edges and six regions with six edges. Thus, no matter where a Hamilton 
circuit is drawn (if it exists), we know that r, + r, = 3 andre + r¢ = 6. 
Observe that for this graph, Eq. (*) reduces to 


2r, — 74) + 40r6 — 16) = 0 


We cannot have rs — ré = 0, that is, ro = r, = 3, for then Eq. (*) would 
require ry, — 14 = Oorry = rj,—-which is impossible since r, + r, = 3. Tf re 
— r! + 0, then |r, — ré| = 2 and so \4(r, — r4)| = 8. Now it is impossible to 
satisfy Eq. (*) since even if r, = 3, 1 = 0 (or m4 = 0, ri = 3), |2(ra — 1] = 
6. Thus, it is impossible for Eq. (*) to be valid for this graph, and so no 
Hamilton circuit can exist. = 


We next present a theorem involving directed graphs and Hamilton paths. 
A tournament is a directed graph obtained from a complete (undirected) graph 
by giving a direction to each edge. 


Theorem 4 
Every tournament has a Hamilton path. 


Proof 


The proof is by induction. For a 2-vertex tournament, a directed Hamilton path 
trivially exists. Next assume by induction that any tournament with n — | 
vertices, for n = 3, has a directed Hamilton path and let us prove that a n- 
vertex tournament G has a directed Hamilton path. 

Remove a vertex z from G, leaving a tournament G’ with n — 1 vertices. 
By the induction assumption, G’ has a Hamilton path H = (x), X2, %3, +++ Xn-1). 
If the edge between z and x, is (z; x), then z can be added to the front of H 
to obtain a Hamilton path for G. Similarly, if the edge between z and X,-) is 
(X,-1, Z), then z can be added to the end of H to obtain a Hamilton path for G. 


Figure 2.8 i 


2.2. Hamilton Circuits 71 


So assume the edge from the first vertex x, of H points toward z and the edge 
from the last vertex x,-, of H points from z. Then for some consecutive pair 
on H, x;-;, x;, the edge direction must change, that is, we have edges (x,_;; z) 
and (z; x;). We can insert z between x;_, and x; in H to obtain a Hamilton path 
(X1, Xo, 666s Xinty 2) Xjp 0 0 ey Xp). @ 


We conclude this section with an application of Hamilton paths to a problem 
in coding theory. 


Example 5 


When a spacecraft is sent to distant planets and transmits pictures back to earth, 
these pictures are transmitted as a long sequence of numbers, each number 
being a darkness value for one of the dots in the picture. For simplicity, assume 
the darkness numbers range between | and 8 . These numbers are actually sent 
as a sequence of Os and 1s. A straightforward encoding scheme would be to 
express each number in its binary representation, that is, 1 as 001, 2 as 010, 3 
as O11, and so on, ending with 8 as 000. 

However, a better scheme, called a Gray code, uses an encoding with the 
property that two consecutive numbers are encoded by binary sequences that 
are almost the same, differing in just one position. For example, a fragment 
of a Gray code might be 4 as 010, 5 as 011, 6 as 001. The advantage of such 
an encoding is that if an error from ‘‘cosmic static’? causes one binary digit 
in a sequence to be misread at a receiving station at Earth, then the mistaken 
sequence will often be interpreted as a darkness number that is almost the same 
as the true darkness number. For example, in the preceding fragment of a Gray 
code, if 011 (5) were transmitted and an error in the last position caused 010 
(4) to be received, the resulting small change in darkness would not seriously 
affect the picture. (Of course, some errors will cause substantial inaccuracies). 

With this background, we now translate the problem of finding a Gray 
code for the 8 darkness numbers into the problem of finding a Hamilton circuit 
in a graph. We define the graph as follows: Each vertex corresponds to a 
3-digit binary sequence, and two vertices are adjacent if their binary sequences 
differ in just one place. The graph is shown in Figure 2.9a. Observe that it is 
a cube (the binary sequences can be thought of as the coordinates of a cube 
drawn in three dimensions). A similar graph can be drawn for the 16 4-digit 
binary sequences, or for any given n, the 2” n-digit binary sequences. 

We claim that the order in which vertices (binary sequences) occur along 
a Hamilton path in this graph produces a Gray code. That is, 1 is encoded as 
the first binary sequence (vertex) in a given Hamilton path, 2 is encoded as 
the second binary sequence, and so on. This process yields a Gray code because 
consecutive vertices in the Hamilton path, which encode consecutive darkness 
numbers, will correspond to binary sequences that differ in just one position. 
Figure 2.96 illustrates how a Hamilton path in the graph produces a Gray 
code. = 
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101 
010 
| a oa 
000 100 


(a) 


Figure 2.9 


The graph with one vertex for each n-digit binary sequence and an edge 
joining vertices that correspond to sequences that differ in just one position is 
called an n-dimensional cube, or hypercube. The graph in Figure 2.9a is a 3- 
dimensional cube (or standard cube). An n-dimensional cube has 2” vertices, 
each of degree n. It has the property that the longest path between any two 
vertices has length n. Hypercubes arise in the design of supercomputers. Ina 
parallel supercomputer with 2” processors, there are too many processors to 
permit a direct connection between every pair of processors. Instead, one 
supercomputer maker interconnects the processors with the structure of a hyper- 
cube graph. 


2.2 EXERCISES 


Summary of Exercises The first ten exercises involve the 
existence or nonexistence of Hamilton paths and circuits. Exercises 9 and 10 
introduce other potential aids for proving nonexistence. Exercises 11-17 involve 
applications of Hamilton circuits. The last five exercises involve theory. 


1. (a) Draw a graph with a Hamilton circuit but no Euler circuit. 
(b) Draw a graph with an Euler circuit but no Hamilton circuit. 


2. Find a Hamilton circuit in each of the following graphs. 


(a) (b) (c) 
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3. Find a Hamilton circuit in the following graph. 


4. Find a Hamilton path in each of the following graphs and prove that no 
Hamilton circuit exists: 


(a) 
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5. Find a Hamilton path in Figure 2.8 and show that no Hamilton circuit 
exists (using the reasoning in Examples 1, 2, 3). 


6. Show that there can be no Hamilton circuit in the following graph using 
both edges (a, f) and (c, A). 


a 
val 
d c 


7, Prove that the following graphs have no Hamilton circuits: 


(a) 


8. Use Theorem 3 to show that the following planar graphs have no Hamilton 
circuit: 
(a) Exercise 4(a) (b) Exercise 4(b) (c) Exercise 4(i) 


9. 


11. 
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Recall from Example | in Section 1.1 that a graph is bipartite if the vertices 
can be divided into two sets, for convenience call them blue vertices and 
red vertices, such that every edge connects a blue and a red vertex. 


(a) Show that if a connected bipartite graph has a Hamilton circuit, then 
the numbers of red and blue vertices must be equal. Further, if a 
bipartite graph has an odd number of vertices, then it has no Hamilton 
circuit. 


(b) Show that if a connected bipartite graph has a Hamilton path, then the 
numbers of reds and blues can differ by at most one. 


(c) Use part (a) to show that the following graphs have no Hamilton circuit: 
(i) Figure 2.8 (ii) Exercise 4(f) (iii) Exercise 7(b). 


. Suppose a set J of k vertices in a graph G is chosen so that no pair of 


vertices in / are adjacent. Then for each x in J, deg(x) — 2 of the edges 

incident to x will not be used in a Hamilton circuit. Summing over all 

vertices in J, we have e’ = Y,<, (deg(x)—2) = {X,e, (deg(x)} — 2k edges 

that cannot be used in a Hamilton circuit. 

(a) Let v and e be the numbers of vertices and edges in G, respectively. 
Show that if e — e’ < v, then G can have no Hamilton circuit. 


(b) Why is the claim in part (a) valid only when / is a set of nonadjacent 
vertices? 

(c) With a suitably chosen set J, use part (a) to show that the following 
graphs have no Hamilton circuits: 
(i) Figure 2.6 (ii) Exercise 4(i) (iii) Exercise 7(b) 

(a) Draw a 4-dimensional hypercube graph. 


(b) Use the graph in part (a) to find a Gray code for encoding the numbers 
1 through 16 as 4-bit binary sequences. 


. Show without citing any theorems stated in this section that any 6-vertex, 


undirected graph with all vertices of degree 3 has a Hamilton circuit. 


. Find a path of knight’s moves visiting all squares exactly once on an 8 X 


8 chessboard. 


. Suppose a classroom has 25 students seated in desks in a square 5 X 5 


array. The teacher wants to alter the seating by having every student move 
to an adjacent seat (just ahead, just behind, on the left, or on the right). 
Show that such a move is impossible. 


- (a) Describe how to construct a circuit including all squares of ann X n 


chessboard, n even, using a rook. Using a king. 
(b) Repeat part (a) for n odd. 


. Consider 27 little cubes arranged in a 3 by 3 by 3 array (as in Rubik’s 


Cube). Form an associated graph with 27 vertices, one for each little cube, 
and with two vertices adjacent if they have touching faces (not just edges). 
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Does this graph have a Hamilton path starting at the vertex corresponding 

to the middle inside cube and ending at one of the vertices corresponding 

to a corner cube? 

17. (a) How many different Hamilton circuits are there in K,, a complete 

graph on n vertices? 

(b) Show that K,, prime = 3, can have its edges partitioned into a(n — 
1) disjoint Hamilton circuits. 

(c) If 17 professors dine together at a circular table during a conference, 
and if each night each professor sits next to a pair of different professors, 
how many days can the conference last? 


18. (a) If a graph G has an Euler cycle, show that L(G), the line graph of G 
(see Exercise 16 of Section 2.1 for the definition of a line graph), has 
a Hamilton circuit. 
(b) If G has a Hamilton circuit, show that L(G) has a Hamilton circuit. 
(c) Show that the converses of parts (a) and (b) are false by finding counter- 
examples. 


19. Show that if G is not a complete graph, then it is possible to direct the 
edges of G so that there is no directed Hamilton path. 


Show that in a tournament (defined preceding Theorem 4) it is always 

possible to rank the contestants so that the person ranked ith beats the 

person ranked (i+1)st. (Hint: Use Theorem 4.) 

21. Show that Theorem 1 is false if the requirement of degree = gn is relaxed 
to just = 3(n—1). 

22. (a) Prove for n = 3 that an undirected graph with n vertices and at least 


20 


CS) + 2 edges must have a Hamilton circuit. 


(b) Show that part (a) is false if there are only & ') + 1 edges. 


—____—/ 


2.3 GRAPH COLORING 


In Example 1 of Section 1.4 we introduced the problem of map coloring— 
coloring the countries of a map so that two countries with a common border 
are assigned different colors. The problem of showing that any map can be 4- 
colored tantalized mathematicians for 100 years until a computer-assisted proof 
was obtained by Appel and Haken in 1976. More recently, graph coloring has 
been applied to a variety of problems in computer science, operations research, 
and design of experiments. Recall that coloring countries in a map is equivalent 
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to coloring vertices, with adjacent vertices getting different colors, in the dual 
graph obtained by making a vertex for each country and an edge between 
vertices representing countries with a common border; see Example 1| in Section 
1.4. In general, a coloring of a graph G assigns colors to the vertices of G so 
that adjacent vertices are given different colors. 

In this section we show how to determine the minimal number of colors 
required to color a given graph. This minimal number of colors is called the 
chromatic number of a graph. We also give some applications of graph 
coloring. In the next section we will present some theorems about graph coloring. 
In a coloring of a graph, the vertices that have a common color will be mutually 
nonadjacent (no pair is joined by an edge). In Example 5 of Section 1.1 we 
introduced the term independent set to refer to sucha set of mutually nonadjacent 
vertices. We shall revisit that example later in this section. 

For graphs with 15 or fewer vertices, it is usually not difficult to guess a 
graph’s chromatic number. To verify rigorously that the chromatic number of 
a graph is a number k, we must also show that the graph cannot be properly 
colored with k — 1 colors. Proving that a graph cannot be (k — 1)-colored is 
similar to proving that a graph has no Hamilton circuit or cannot be isomorphic 
to another particular graph. In this case, the goal is to show that any (k — 1)- 
coloring we might construct for the graph must force two adjacent vertices to 
have the same color. 


Example 1 


Find the chromatic number of the graph in Figure 2.10. 

Looking at the inner square with crossing diagonals, we see that vertices 
a, C, €, g are mutually adjacent, that is, they form a complete subgraph. They 
each require a different color in a proper coloring, four colors in all. Once four 
colors are available, it is easy to properly color the remaining vertices b, d, f, 
h. Each of them is adjacent to only two other vertices, and so at most two out 
of the four colors need ever be avoided with these vertices. Let us use the 
numbers 1, 2, 3, 4 as the ‘‘names’’ of our colors. Then one possible 4-coloring 
of the graph is shown in Figure 2.10. 

In this problem it is immediately clear that the graph cannot be 3-colored, 
since some adjacent pair of vertices in the complete subgraph formed by a, c, 


Figure 2.10 
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e, g would have the same color in a 3-coloring. So the chromatic number of 
this graph is 4. =~ 


Example 1 points up two important rules. First, a complete subgraph on k 
vertices requires k colors [cannot be (k — 1)-colored]. Second, when building 
a k-coloring of some graph, we can ignore all vertices of degree < & (and their 
incident edges), since once the other vertices are colored, there will always be 
at least one color available (not used by any adjacent vertex) to properly color 
each such vertex. 


Example 2 


Find the chromatic number of the graph in Figure 2.11. 

We note that a graph of this form is called a wheel. The largest complete 
subgraph in this graph is a triangle. Let us try to build a 3-coloring of this 
graph with ‘‘colors’’ 1, 2, and 3. We will start by coloring the vertices of a 
triangle. Suppose we choose the triangle a, b, f: Let a be 1, b be 2, and f be 
3 (the order of colors is arbitrary). See Figure 2.11. Since c is adjacent to 
vertices b and f of colors 2 and 3, respectively, c is forced to be color 1. 
Similarly, d is forced to be 2, and then e is forced to be 1. However, now the 
adjacent vertices a and e both have color 1. Thus the graph cannot be 3-colored. 
On the other hand, using a fourth color for e yields a proper coloring. So the 
chromatic number of this graph is 4. = 


Observe that in Example 2 if vertex e were missing and d were adjacent 
to a instead, then three colors would work. In general, wheel graphs with an 
even number of ‘“‘spokes’’ can be 3-colored, whereas wheels with an odd 
number of spokes require four colors. 

The key to the impossibility of finding a 3-coloring in Example 2 is the 
sequence c, d, e of forced vertex colors. In general, when attempting to build 
a k-coloring graph, it is desirable to start by k-coloring a complete subgraph 
of & vertices and then successively finding an uncolored vertex adjacent to 
vertices of k — 1 different colors, thereby forcing the color choice for this 
vertex. 


e, 1 b,2 


Figure 2.11 d,2 c 1 
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Example 3 


Find the chromatic number of the graph in Figure 2.12. 

The largest complete subgraph is again a triangle, and so we want to try 
building a 3-coloring. The only triangles whose coloring will force the color 
of another vertex are (d, e, f) and (e, f, g). Suppose we color d 1, e 2, and f 
3. Then g is forced to be 1. Now we are in trouble, since no more uncolored 
vertices are adjacent to two colors. 

Observe that b and c are both adjacent to a vertex of color 1 and are 
adjacent to each other. Thus, one of b and c must be color 2 and the other 
color 3. By the symmetry of the graph, we can assume b is 2 and c 3, although 
it actually does not matter. What is important is that one of b, c is color 2 and 
the other color 3, for this forces the color of a to be 1. Similarly A and i will 
be 2 and 3, collectively, forcing j to -be 1. But the adjacent pair a, j are both 
1. Thus the graph cannot be 3-colored. Making a or j a fourth color yields a 
proper 4-coloring, and so the graph’s chromatic number is 4. = 


Let us now look at some applications of graph coloring. We start by 
repeating Example 5 of Section 1.1, which involved the problem of scheduling 
committee meeting times. 


Example 4 


A state legislature has many committees that meet for one hour each week. 
One wants a schedule of committee meeting times that minimizes the total 
number of hours but such that two committees with overlapping membership 
do not meet at the same time. We show that this is.a graph-coloring problem. 

The key information about the committees is which committees have over- 
lapping membership. Let us create a graph with a vertex corresponding to each 
committee and with an edge joining 2 vertices if they represent committees 
with overlapping membership. For concreteness, suppose that the graph in 
Figure 2.13 represents the membership overlap of 10 legislative committees. We 
must schedule the vertices so that adjacent vertices (overlapping committees) get 
different meeting hours. A coloring of this graph performs exactly this type of 
“‘scheduling.’’ The colors will represent different meeting times. The graph in 
Figure 2.13 has a chromatic number of 4 (a minimal coloring is shown in 


Figure 2.12 € a1 i 
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Figure 2.13). Thus, four hours suffice to schedule committee meetings without 
conflict. = 


Figure 2.13 


Example 5 


In a simplified (one-dimensional) form of Very Large Scale Integrated circuit 
(VLSI) design, one has a row of logical subcircuits called gates G,, Go, ..-, 
G, with specified connections between certain pairs of these gates. These con- 
nections are laid in a set of parallel tracks, as illustrated in Figure 2.145. Two 
connections must go in different tracks if they would overlap, even if just at 
a common endpoint. ; 

What is the minimum number of tracks needed for the following six connec- 
tions among a sequence of six gates: (/, 3), (1, 3)—two copies of this connec- 
tion—(2, 3), (2, 4), (2, 5), (4, 5)? See Figure 2.14b. The set of connections 
can be treated as a family of intervals on a line. Two intervals (connections) 
can have the same track if they do not overlap. We model this problem as a 
graph-coloring problem. 

We form a graph indicating which connections overlap. Since the connec- 
tions are intervals along the line from 1 to 5, this connection graph is just a 
graph of overlapping intervals. Interval graphs were discussed in Examples 6 
and 7 in Section 1.1. We make a vertex for each interval and join two vertices 
with an edge if they represent intervals that overlap. See Figure 2.14a. Now 
let colors stand for tracks. A minimal coloring of the interval graph will corre- 
spond to a minimal track assignment. Since the interval graph contains a 
complete graph on five vertices—all vertices except (4, 5)—-five colors (tracks) 
are needed, as shown in Figure 2.14d. 


(1, 3) 

Gy G, 
(1, 3) (2, 3) 
G2 G3 
G3 G2 
(2, 5) (2, 4) Ga Ga 
(4, 5) Gs Gs 
(a) (b) (c) 


Figure 2.14 
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Frequently the order of the gates is under the designer’s control. Is it 
possible to order the gates differently so as to reduce the number of tracks in 
the previous design problem? Changing the gate order produces a different set of 
connection intervals and so may change the chromatic number of the associated 
interval graph. By inspection, we observe that with the gate order /, 3, 2, 4, 
5, only three tracks are needed. See Figure 2.14c. (No efficient procedure is 
known for finding the best gate order; a huge amount of computer time in 
VLSI design is taken up searching for the best order.) = 


Example 6 


We now consider a complex optimization problem in which graph coloring 
plays only a secondary role. There is a set of sites S; that must be serviced 
(visited) k; times each week (1 = k; = 6). We seek a minimal set of day-long 
truck tours for a week such that each site is visited on k; of the tours. In addition, 
we require that these tours can be partitioned among the six days of the week 
(Sunday is excluded) in a manner so that no site is visited twice on one day. 
This is an extremely difficult problem, which cannot be solved exactly. 

When a problem of this type (involving garbage collection) was analyzed 
for the New York City Department of Environmental Protection, an algorithm 
was used that started with an inefficient set of tours and successively tried to 
improve the set of tours. Suppose we had a simplified situation where the week 
contained just three workdays, and our algorithm had generated the set of tours 
shown in Figure 2.15a. Can these six tours be so partitioned among the three 
workdays so that no site is visited twice on the same day? Another way to 
state the constraint is: If two tours have a site in common, then the tours are 
assigned to different workdays. This is the type of constraint handled by a 
coloring model. 

Given a set of tours, we form an associated tour graph with one vertex for 
each tour and two vertices adjacent if they correspond to two tours that visit 
a common site. We assign colors, representing the different workdays, to the 


r= iwi 
fap > 


Figure 2.15 (a) (b) 
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vertices (tours) of the tour graph. In the general six-day problem, partitioning 
the tours among the six days is equivalent to 6-coloring the tour graph. In the 
tour graph in Figure 2.15b, we desire just a 3-coloring, which indeed exists. 
If the optimizing algorithm were next to combine tours A and F in Figure 2.15a 
to get a smaller set of tours, such a move would have to be blocked (and other 
optimizing moves tried instead) because the resulting tour graph would have 
a complete graph on the four vertices A, B, C, D; this would require four colors 
(days). = 


2.3. EXERCISES 


Ms Summary of Exercises Exercises 1-8 involve finding minimal 


vertex colorings and associated problems. Exercises 9-11 require minimal 
colorings of maps and a geometric array. Exercises 12-18 are color-modeling 
problems. 


1. Find the chromatic number of each of the following graphs. Give a careful 
argument to show that fewer colors will not suffice. 


(a) (b) (c) 
r VN 
(e) 


a 


wy <> 


2.3. Graph Coloring 83 


. Find a minimal edge coloring of the following graphs (color edges so that 


edges with a common end vertex receive different colors). 
(yy? 


(c) Exercise 1(h) 


(b) 


b c 
d e £ 


. A graph G is color critical if the removal of any vertex of G decreases 


the chromatic number. Which graphs in Exercise 1 are color critical? 


. Find all sets of three or more vertices that could have the same color in a 


proper coloring of the graph in: 
(a) Exercise 1(a) (b) Exercise 1(b) 


- A coloring partitions a graph G into sets of mutually nonadjacent vertices. 


In the complement G of G, this partition becomes a partition of G into 
sets of mutually adjacent vertices, that is, complete subgraphs. Find such 
a minimal set of complete subgraphs partitioning the vertices in the graph 
in: 


(a) Exercise 1(a) (b) Exercise 1(b) (c) Exercise 1(d) 


» An equitable coloring is a minimal coloring in which the numbers of 


vertices of each color differ by at most one. Which of the following graphs 
have minimal colorings that are equitable? 


(a) Exercise 1(a) (b) Exercise 1(c) 


- (a) The chromatic polynomial P,(G) of a graph G is a polynomial in k 


that gives for each positive integer k, the number P,(G) of different 
k-colorings of G. Determine P,(G) for the following graphs: 
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10. 


11. 


12. 


13. 


[rl Z424Q LA. 


@ (ii) (iii) 


(b) If we know PG), how can we use it to determine the chromatic 
number of G? 


. An edge coloring assigns colors to edges so that edges with a common 
end vertex receive different colors. Describe graphs that can be edge colored 
using just two colors. 


. Can the 50 states in a map of the United States be properly 3-colored? 
(Note that states meeting only at a corner, such as Colorado and Arizona, 
are not considered adjacent.) 


Suppose a map is made by drawing n intersecting circles. Show that the 
regions in this map can be properly 2-colored. 


(a) Solve by an inductive argument. 


(b) Solve by assigning colors based on the number of circles that contain 
a region. 


How many colors are needed to color the 15 billiard balls in this triangular 
array with touching balls different colors? 


The Applied Math Department is scheduling the times for classes for next 
semester. Each student has already decided which subset of ApMath classes 
he/she wants to take. Classes must be scheduled so that every student can 
have the courses requested without conflict. Model this scheduling problem 
as a graph-coloring problem. What are the vertices, what are the edges, 
what are the colors? 


A banquet center has 8 different special rooms. Each banquet requires 
some subset of these 8 rooms. Suppose that there are 12 evening banquets 
that we wish to schedule in a given week (7 days). Two banquets that are 
scheduled on the same evening must use different special rooms. Model 
and restate this scheduling problem as a graph-coloring problem. 


. (a) A set of solar experiments are to be made at observatories. Each 
experiment begins on a given day of the year and ends on a given day 
(each experiment is repeated for several years). An observatory can 
perform only one experiment at a time. The problem is, What is the 
minimum number of observatories required to perform a given set of 


2.4 Coloring Theorems 85 


experiments annually? Model this scheduling problem as a graph- 
coloring problem. 


(b) Suppose experiment A runs from Sept. 2 to Jan. 3, experiment B from 
Oct. 15 to March 10, experiment C from Nov. 20 to Feb. 17, experiment 
D from Jan. 23 to May 30, experiment E from April 4 to July 28, 
experiment F from April 30 to July 28, and experiment G from June 
24 to Sept. 30. Draw the associated graph and find a minimal coloring 
(show that fewer colors will not suffice). 


15. A convention of mathematicians will have rooms available in six hotels. 
There are n mathematicians and because of personality conflicts, various 
pairs of mathematicians must be put in different hotels. The organizers 
wonder whether six hotels will suffice to separate all conflicts. Model this 
conflict problem with a graph and restate the problem in terms of vertex 
coloring. 


16. Which of the following pairs of tours in Figure 2.15a@ can be combined 
without violating the 3-colorability requirement of the tour graph? 


(a) Tours D and E. (b) Tours C and D. 


17. In a round-robin tournament where each pair of n contestants plays each 
other, a major problem is scheduling the play over a minimal number of 
days (each contestant plays at most one match a day). 


(a) Restate this problem as an edge-coloring problem (see Exercise 8). 
(b) Solve this problem for n = 4 and n = 5, 


18. Consider a graph representing games played between a set of football teams 
with a directed edge from vertex A to vertex B if team A beats team B. 
Suppose that it is known that this graph has no directed circuits (i.e., no 
situation such as A beats B, B beats C, and C beats A). We can define a 
set of levels in this football graph as follows: A vertex with no outward 
edges (the team beat no other team) is at level 0; if all a vertex’s outward 
edges go to level-0 vertices (no-win teams), then the vertex is at level 1; 
and in general, a vertex is at level k if the greatest level of a team it beat 
is level k — 1. Show that the level number of each vertex is a proper 
“‘coloring’’ of the vertices. 


—_—____ 4 


2.4 COLORING THEOREMS 


In the previous section we studied strategies for finding a minimal coloring of 
a graph and gave some applications that could be modeled as graph-coloring 
problems. In this section we present some theorems about graph coloring. 
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Lil SLX 


ie A e 


Figure 2.16 (a) (b) 


We begin with a theorem about coloring the corners of a polygon and use 
it to obtain a simple solution to an interesting problem in computational geome- 
try. We treat a polygon as a plane graph consisting of a single circuit with 
edges drawn as straight lines. The polygon need not be a convex figure. See 
the sample polygon in Figure 2.16a. By a triangulation of a polygon, we mean 
the process of adding a set of straight-line chords between pairs of vertices of 
a polygon so that all interior regions of the graph are bounded by a triangle 
(these chords cannot cross each other nor can they cross the sides of the 
polygon). Figure 2.16b shows one possible triangulation of the polygon in 
Figure 2.16a. 


Theorem I 
The vertices in a triangulation of a polygon can be 3-colored. 


Proof 


Our proof is by induction on 7, the number of edges of the polygon. For n = 
3, give each corner a different color. Assume that any triangulated polygon 
with less than n boundary edges, n = 4, can be 3-colored and consider a 
triangulated polygon T with n boundary edges. 

Pick a chord edge e, as illustrated by chord (g, &) in Figure 2.17a. We note 
that T must have at least one chord edge, or else since n = 4, T would not be 
triangulated. This chord e splits T into two smaller triangulated polygons, as 
shown in Figure 2.17b, each of which can be 3-colored by the induction 
assumption. The 3-colorings of the two subgraphs can be combined to yield a 
3-coloring of the original triangulated polygon by picking the names for the 
colors in the two subgraphs so that the end vertices of chord e have the same 
colors in each subgraph. In Figure 2.17b, this would mean making the color 
of k be the same in the two subgraphs and making the color of g be the same. 


In practice, it is easy to produce a 3-coloring of a triangulation of a polygon. 
Moreover, this 3-coloring is unique. For details, see Exercise 5. We now present 
an interesting application of Theorem 1 to a problem that seems to have nothing 
to do with coloring. 
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(a) (b) 
Figure 2.17 


The Art Gallery Problem asks what is the least number of guards needed 
to watch paintings along the n walls of an art gallery. The walls are assumed 
to form a polygon. The guards need to have a direct line of sight to every point 
on the walls. A guard at a corner is assumed to be able to see the two walls 
that end at that corner. In 1978 Fisk found a simple solution to the Art Galley 
Problem using Theorem 1. The expression [7] denotes the largest integer <= r. 
For further details, see O’Rourke [5]. 


Corollary 
The Art Gallery Problem with n walls requires at most |n/3| guards. 


Proof 


Make a triangulation of the polygon formed by the walls of the art gallery. 
Observe that a guard at any corner of a triangle has all sides of the triangle 
under surveillance. Now obtain a 3-coloring of this triangulation. Note that 
each triangle will have one corner of each color. Take one of the colors, say 
“‘red,’’ and place a guard at every corner colored ‘‘red.’’ This places a guard 
at a corner on every triangle. Hence the sides of all triangles, and, in particular 
all the gallery walls, will be watched. A polygon with n walls has n corners. 
If there are n corners and 3 colors, some color is used at |n/3| or fewer 
comers. @ 


This bound is the best possible. For example, Figure 2.18 gives an example 
of a polygon with 12 corners that requires 4 guards. 

We now present four representative coloring theorems. We will prove only 
the last one, the 5-color theorem for planar graphs. For proofs of Theorems 
2—4 see [4]; Theorem 3 is also proved in the exercises. Note that Theorem 2 
in Section 1.3 was a coloring theorem. As stated there, it said that a connected 


Figure 2.18 
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graph is bipartite if and only if all circuits have even length. Observe that being 
bipartite is the same as being 2-colorable. Thus, a graph is 2-colorable if and 
only if all circuits have even length. (We can drop the condition that the graph 
be connected, because if each component is 2-colorable, then the whole graph 
is 2-colorable.) Let y(G) denote the chromatic number of the graph G. 


Theorem 2. Brook’s Theorem 


If the graph G is not an odd circuit or a complete graph, then x(G) = d, where 
d is the maximum degree of a vertex of G. 


The maximum degree is for most graphs a poor upper bound on x(G). The 
examples in Section 2.3 had x(G) closely related to the size of the largest 
complete subgraph. Thus, it seems natural that there should be a good bound 
on x(G) in terms of the size of the largest complete subgraph. The following 
theorem shows that this approach fails. 


Theorem 3 
For any positive integer k, there exists a triangle-free graph G with x(G) = k. 


Instead of coloring vertices, we can color edges so that edges with a 
common end vertex get different colors. A very good bound on the edge 
chromatic number of a graph in terms of degree is possible. All edges incident 
at a given vertex must have different colors, and so the maximum degree of a 
vertex in a graph is a lower bound on the edge chromatic number. Even better, 
one can prove: 


Theorem 4 Vizing’s Theorem 


If the maximum degree of a vertex in a graph G is d, then the edge chromatic 
number of G is either d or d + 1. 


Finally we would like to present and prove a theorem about coloring planar 
graphs. As noted earlier, in 1976 Appel and Haken proved that all planar graphs 
can be 4-colored. But their proof is incredibly long and requires thousands of 
hours of computer analysis involving 1955 cases, each of which in turn involves 
many pages of analysis. We will state and prove an easier ‘‘second best’’ 
theorem. 


Theorem 5 
Every planar graph can be 5-colored. 


Proof 


We need to consider only connected planar graphs, since we can 5-color uncon- 
nected planar graphs by 5-coloring each connected component. A key step in 


2.4 Coloring Theorems 89 


this proof uses a fact about planar graphs proved in Exercise 16 in Section 1.4: 
Any connected planar graph has a vertex of degree at most 5. We prove this 
theorem by induction on the number of vertices. Trivially a 1-vertex graph can 
be 5-colored. 

Next we assume that all connected planar graphs with n — 1 vertices 
(n 2 2) can be 5-colored. We will prove that a connected planar graph G 
with n vertices can be 5-colored. As noted above, G has a vertex x of degree 
at most 5. Delete x from G to obtain a graph with n — 1 vertices, 
which by assumption can be 5-colored. Now we reconnect x to the rest of 
the graph and try to properly color x. If x has degree = 4, then we can 
simply assign x a color different from the colors of its neighbors. The same 
approach works if the degree of x is 5 but two neighbors have the same color. 
Thus, it remains to consider the case where x has five adjacent vertices 
each with a different color. See Figure 2.19, where we label the neighbors 
of x as a, b, c, d, e according to their clockwise order about x in some 
planar depiction of G. Let the colors be the numbers shown in Figure 
2.19. 

First consider all paths starting from a whose vertices are colored 1 and 
3. See Figure 2.19. Suppose there is no path consisting of 1 and 3 vertices 
from a to c. Then we can change the color of a from 1 to 3, change the 
neighbors of a colored 3 to 1, and so on along all paths of 1 and 3 vertices 
emanating from a. This 1—3 interchange will not affect c since there is no path 
of 1s and 3s from a to c. After this 1-3 interchange from a, a and c are both 
color 3, and x can be properly colored with 1. 

On the other hand, if there is a 1-3 path from a to c, then we consider all 
paths starting from b, whose vertices are colored 2 and 4. The 1-3 path from 
a to c, together with edges (x, a), (x, c), form a circuit that blocks the possibility 
of any 2—4 path going from b to d. Thus, we can perform a 2—4 interchange 
along all paths of 2s and 4s emanating from b without changing the color of 
d. After this 2—4 interchange, b and d are both color 4, and x can be properly 
colored with 2. 

This completes the induction step of the proof that any m-vertex connected 
planar graph can be 5-colored. 


Figure 2.19 
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2.4 EXERCISES 


Mh Summary of Exercises Theseexercises are all proofs of results 


in coloring theory. 


1. 


Use the fact that every planar graph has a vertex of degree < 5 to give a 
simple induction proof that every planar graph can be 6-colored. Follow 
the argument in the beginning of the proof of Theorem 5. 


. Show that a planar graph G with 8 vertices and 13 edges cannot be 2- 


colored. (Hint: Use results in Section 1.4 to show that G must contain a 
triangle.) 


. For any two nonadjacent vertices x, y in a graph G, define graphs Gj and 


G‘, as follows. G;, is obtained by adding the edge (x, y) to G, and Gt, is 
obtained from G by coalescing vertices x and y into a single vertex. Show 
that x(G) = min{x(G.y), x(Gy)}. 


. Prove by induction that the graph of any triangulation of a polygon will 


have at least two vertices of degree 2. (Hint: Split the triangulation graph 
into two triangulation graphs at some chord e.) 


. (a) Using the type of reasoning in Section 2.3, explain how to 3-color any 


triangulation of a polygon. 


(b) Use the argument in part (a) to show that the 3-coloring of any triangula- 
tion of a polygon is unique, except for changing the names of the 
colors. 


. (a) If q is the size of the largest independent set in a graph G, show that 


x(G)q = n. 

(b) If the minimum degree of a vertex is d in an n-vertex graph G, then 
use the result in part (a) to show that x(G)(n — d) = n, and hence 
x(G) = nf(n - a). 


. A graph is color critical if the removal of any vertex decreases the graph’s 


chromatic number. Show that every k-chromatic color critical graph G has 
the following properties: 


(a) G is connected. 
(b) Every vertex of G has degree = k — 1. 
(c) G has no vertex whose removal disconnects G. 


. Show that G can be edge k-colored if and only if L(G), the line graph of 


G (see Exercise 16 in Section 2.1), can be vertex k-colored. 


. If G is the complement of G, then show that: 


(a) x(G) + x(G) <n + 1 (Hint: Use induction.) 


(b) x(G)xG) = n. 
(c) x(G) + xG) = 2Vn 
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10. Show that if every region in a planar graph has an even number of bounding 
edges, then the vertices can be 2-colored. 


11. The chromatic polynomial P,(G) of a graph G is a polynomial in & that 
gives for each positive integer k, the number P,(G) of different k-colorings 
of G. Show that no planar graph G has a chromatic polynomial of the form 
P(G) = (kK — 6k + 8)O(K). 

12. Show that a graph with at most two odd-length circuits can be 3-colored. 


13. Show that a graph is k-colorable if and only if its edges can be directed 
so that its longest path has length k — 1. 


14. Use the fact that every planar graph with fewer than 12 vertices has a 
vertex of degree = 4 (Exercise 17 in Section 1.4) to prove that every planar 
graph with less than 12 vertices can be 4-colored. 


15. Show that if G is an interval graph (see Example 6 in Section 1.1), then 
X(G) equals the size of the largest complete subgraph in G. 


16. Prove Theorem 3 using the following hints. 


(a) The proof should be by induction on k, the chromatic number. Initially 
for k = 3, we use the graph G; consisting of a 5-circuit. 


(b) Assuming one can construct G,, a triangle-free graph with x(G,) = k, 
one constructs G;,, by making k copies of G, and then adding (n,)* 
vertices, where n, is the number of vertices in G,. Each new vertex 
has as its set of neighbors a different k-tuple consisting of one vertex 
from each copy of G;. Confirm that this new graph is the desired G;,,. 


—see 


Wy 2.5 SUMMARY AND REFERENCES 


This chapter presented two important graph-theoretic concepts: covering cycles 
or circuits and coloring. Section 2.1 discussed Euler cycles—cycles that traverse 
every edge exactly once. Section 2.2 discussed Hamilton circuits—circuits that 
visit every vertex exactly once. Both types of covering edge sets arise naturally 
in operations research routing problems. Despite the similarity in the definitions 
of Euler cycles and Hamilton circuits, determining the existence of Euler cycles 
and Hamilton circuits in a graph are as different as graph-theoretic problems 
can be. Euler’s theorem allows one quickly to decide whether an Euler cycle 
exists. On the other hand, except in special cases, the existence or nonexistence 
of a Hamilton circuit can be determined only by a laborious systematic search 
to try all possible ways of constructing a Hamilton circuit. 

Section 2.3 introduced graph coloring with ad hoc coloring schemes and 
some applications of coloring. Section 2.4 gave a sampling of coloring theory, 
highlighted by a proof of the fact that any planar graph can be 5-colored. The 
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stronger theorem proved in 1976 by Appel and Haken [1], that planar graphs 
are 4-colorable, was the motivation of much of the research in graph theory 
over the last 100 years. The search for a proof of the Four Color Theorem led 
to reformulations of this theorem in terms of Hamilton circuits and other graph 
concepts whose properties were then examined. 

As mentioned in Section 1.4, Euler’s 1736 analysis of Euler circuits was 
the first paper on graph theory. Euler’s paper (translated in Biggs, Lloyd, and 
Wilson [3]) makes very interesting reading. It is instructive to see how awkward 
Euler’s writing was when he lacked the modern terminology of graph theory. 
The first use of the concept of a Hamilton circuit occurred in a 1771 paper by 
A. Vandermonde that presented a sequence of moves by which a knight could 
tour all positions of a chessboard (without repeating a position). The name 
‘*Hamilton’’ refers to W. Hamilton, whose algebraic research led him to con- 
sider special types of circuits and paths on the edges of a dodecahedron [see 
Exercise 2(c) in Section 2.2]. Hamilton even had a game marketed that involved 
finding a Hamilton circuit on a dodecahedron (Hamilton’s instructions for this 
game are reprinted in [3]). See Barnette [2] for a good history of the Four 
Color Problem, its restatements, and final solution by Appel and Haken. 


1. K. Appel and W. Haken, ‘‘Every planar map is 4-colorable,’’ Bull. Am. 
Math. Soc. 82 (1976), 711-712. 

2. D. Barnette, Map Coloring and The Four Color Problem, Mathematical 
Association of America, Washington, DC, 1984. 

3. N. Biggs, E. Lloyd, and R. Wilson, Graph Theory 1736-1936, Clarendon 
Press, Oxford, 1976. 

4. J. Bondy and U. Murty, Graph Theory with Applications, American Elsev- 
ier, New York, 1976. 

5. J. O’Rourke, Art Gallery Theorems and Algorithms, Oxford University 
Press, New York, 1987. 


CHAPTER 3 
TREES AND SEARCHING 


3.1 PROPERTIES OF TREES 


The most widely used special type of graph is a tree. There are two ways to 
define trees. In undirected graphs, a tree is a connected graph with no circuits. 
Alternatively, one can define a tree as a graph with a designated vertex called 
a root such that there is a unique path from the root to any other vertex in the 
tree. This second definition applies to directed as well as undirected graphs. 

Intuitively, a tree looks like a tree. See the examples of trees in Figure 3.1. 
The vertex labeled a is a root for each of the trees in Figure 3.1. The equivalence 
of the above two definitions is proved in Exercise 5. To illustrate the equivalence, 
observe that in the tree in Figure 3.1a, the addition of an edge (g; h) would 
create a circuit (when edge directions are ignored) and would simultaneously 
create a second path from root a to h via g. 

Trees are a remarkably powerful tool for organizing information and search 
procedures. In this chapter, we survey some of the diverse settings in which 
trees can be used to represent and analyze search procedures. These settings 
include solving puzzles (Section 3.2), testing properties of graphs such as 
connectedness (Section 3.3), solving the ‘‘traveling salesperson’’ problem (Sec- 
tion 3.4), and sorting lists (Section 3.5). In this first section, we present some 
basic properties of trees and introduce some convenient terminology for working 
with trees. We prove several useful counting formulas about trees and illustrate 
some uses of these formulas. 

Observe that if a tree is an undirected graph (with no directed edges), then 
any vertex can be the root. For example, the tree in Figure 3.1b is drawn so 
that a appears to be a root, but the tree in Figure 3.1c, which is a redrawing 
of Figure 3.1b, has no single vertex that is a natural root—that is, any vertex 
can be the root. 

In most of this chapter we will be using trees with directed edges. Following 
common terminology, we call a directed tree a rooted tree. A rooted tree T 
has a unique root, for if vertices a and b were both roots of T, then there would 
be paths from a to b and from b to a forming a circuit. An undirected tree is 
unrooted in the sense that it has no one particular root. An undirected tree can 
be made into a rooted tree by choosing one vertex as the root and then directing 
all edges away from the root. For example, to root the undirected tree in Figure 
3.1b at vertex a, we would simply direct all the edges from left to right. 
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(b) (c) 
Figure 3.1 


The standard way to draw a rooted tree T is to place the root a at the top 
of the figure. Then the vertices adjacent from a are placed one level below a, 
and so on, as in Figure 3.1a. We say that the root a is at level 0, vertices b 
and c in that tree are at level 1, vertices d, e, f, and g in that tree are at level 
2, and so forth. The level number of a vertex x in T is the length of the (unique) 
path from the root a to x. 

For any vertex x in a rooted tree T, except the root, the parent of x is the 
vertex y with an edge (y,; x) into x (the unique edge directed into x). The 
children of x are vertices z with an edge directed from x to z. Children have 
level numbers one greater than x. Two vertices with the same parent are siblings. 
The parent—child relationship extends to ancestors and descendants of a vertex. 
In Figure 3.la, vertex e has b as its parent, A and i as its children, d and f as 
its siblings, a as its other ancestor, and / as its other descendant. Observe that 
each vertex x in a tree T is the root of the subtree of x and its descendants. 
For easy reference, there is a glossary of tree-related terminology at the end 
of this text. 


Theorem 1 
A tree with n vertices has n — 1 edges. 


Proof 


Assume that the tree is rooted; if undirected, make it rooted as described above. 
We can pair off a vertex x with the unique incoming edge (x, y) from its parent 
y. Since each vertex except the root has such a unique incoming edge, there 
are n — 1 nonroot vertices and hence n — 1 edges. @ 


Vertices of T with no children are called leaves of T. Vertices with children 
are called internal vertices of T. If each internal vertex of a rooted treé has m 
children, we call T an m-ary tree. If m = 2, T is a binary tree. 


Theorem 2 


Let T be an m-ary tree with 1 vertices, of which i vertices are internal. Then, 
n=mict il. 
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Proof 


Each vertex in a tree, other than the root, is the child of a unique vertex (its 
parent). Each of the 7 internal vertices has m children, and so there are a 
total of mi children. Adding the one nonchild vertex, the root, we have n = 
m-+il.¢ 


Corollary 


Let T be an m-ary tree with n vertices, consisting of i internal vertices and / 
leaves. If we know one of n, i, or /, then the other two parameters are given 
by the following formulas: 


(a) Given i, then] = (m — 1)i+ landn = mit 1. 
(b) Given J, then i = (1 — 1)/(m — 1) and n = (ml — 1)/(m — 1). 
(c) Given n, then i = (n — 1)/m and / = [(m — 1)n + 1)/m. 


The proof of the corollary’s formulas follow directly from n = mi + 1 (Theorem 
2) and the fact that / + i = n. Details are left as an exercise. 


Example 1 


If 56 people sign up for a tennis tournament, how many matches will be played 
in the tournament? 

The tournament proceeds in a binary-tree-like fashion. The entrants are 
leaves and the matches are the internal vertices. See Figure 3.2. Given /] = 56 
and m = 2, we determine i from part (b) of the corollary: i = (J — 1)/(m — 
1) = (56 — 1)(2 — 1) = 55 matches. m= 


Example 2 


Suppose a telephone chain is set up among 100 parents to warn of a school 
closing. It is activated by a designated parent who calls a chosen set of three 
parents. Each of these three parents calls given sets of three other parents, and 
so on. How many parents will have to make calls? Repeat the problem for a 
telephone tree of 200 parents. 

Such a telephone chain is a rooted tree with 100 vertices. An edge corres- 
ponds to a call and an internal vertex corresponds to a parent who makes a 
call. Since we know n = 100 and that the tree is ternary (3-ary), part (c) of 


Figure 3.2 Ss 


Winner 
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the corollary can be used to determine i, the number of callers: i = (n — 1)/ 
m = (100 — 1)/3 = 33. 

When we repeat the computation for an organization of 200 people, we 
get i = (200 — 1)/3 = 663 internal vertices. By Theorem 2, a ternary tree must 
have a number of vertices n equal to 3i + 1, for some i—that is, n = 1 (mod 
3). But 200 = 2 (mod 3), and so we do not have a true ternary tree. Either 
199 or 202 parents would give a ternary tree. As a practical matter, with 200 
people there will be 66 parents who each make 3 calls and one parent who 
makes just one call (one internal vertex with one child). = 


The height of a rooted tree is the length of the longest path from the root 
or, equivalently, the largest level number of any vertex. A rooted tree of height 
his called balanced if all leaves are at levels h and h — 1. Balanced trees are 
‘‘good’’ trees. The telephone chain tree in Example 2 should be balanced to 
get the message to everyone as quickly as possible. A tennis tournament’s tree 
should be balanced to be fair; otherwise some players could reach the finals 
by playing several fewer matches than other players. Making an m-ary tree 
balanced will minimize its height. The tree in Figure 3.1b, with a as root, is a 
balanced binary tree of height 3. 


Theorem 3 
Let T be an m-ary tree of height h with / leaves. Then: 


(a) | <= m", and if all leaves are at height h, 1 = m*. 
(b) A = [log,,/], and if the tree is balanced, h = [log,,/ |. 


Proof 


The expression [r] denotes that the smallest integer = r; that is, [7] rounds r 
up to the next integer. 


(a) Obviously an m-ary tree of height 1 has m leaves (children of the root). 
Now we use induction on m to show that an m-ary tree of height / has at 
most m' leaves with / = m’" if all leaves are at level h. An m-ary tree of 
height h can be broken into m subtrees rooted at the m children of the root. 
See Figure 3.3 These m subtrees have height at most h—1. By induction, 
each of these subtrees has at most m""' leaves, and if all leaves are at 
height h—1 in the subtrees, each has exactly m'~'. The m subtrees combined 
have at most m X m7! = m* leaves, and if all leaves are at height h, there 
are exactly m" leaves. 


Taking the logarithm base m on both sides of the inequality / = m" yields 
log,,/ < h. Since h is an integer, we have [log,,/ | = h. If the tree is balanced 
with height /, then the largest possible value for / is / = m°* (if all leaves 
are at level 4), and the smallest possible value is / = m’"'+ 1 (with one 
leaf at level h and the rest at level h—1). So m*!< 1 = m". Taking 
logarithms on both sides yields h — 1 < log,/ < A, or h = [log,/]. 


(b 


— 


3.1 Properties of Trees 97 


Figure 3.3 


The most common use of trees is in searching. The following two sequential 
testing examples, one a basic computer science problem and the other a logical 
puzzle, illustrate the use of trees in searching. 


Example 3 


Let us reexamine the compiler ‘‘dictionary look-up’’ problem discussed in 
Example 3 of Section 1.1. We want to identify an unknown word (number) X 
by testing it in a three-way branch (less than, equal to, greater than) against 
“‘words’’ in a set (dictionary) to which X belongs. The test procedure can be 
represented by a binary, or almost binary, tree. If X were one of the first 14 
letters of the alphabet, then Figure 3.4 is such a binary search tree. Each vertex 
is labeled with the letter tested at that stage in the procedure. The procedure 
starts by testing X against H. The left edge from a vertex is taken when X is 
less than the letter and the right edge when X is greater. Such a tree may have 
one internal vertex with just one child if the number of vertices is even (as is 
the case for vertex N in Figure 3.4). This tree is built by making the middle 
letter in the list (in this case, G or H) the root. The left child of the root is the 
middle letter in the left subtree (in this case, D), and so on. 

To minimize the number of tests needed to recognize any X, that is, the 
height of the search tree, we should make the tree balanced. Suppose that X 
were to belong to a set of n ‘‘words.’’ What is the maximum number of tests 
that would be needed to recognize X? By the corollary to Theorem 2, a binary 
search tree with n vertices has 
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leaves. Then the maximum number of tests needed to recognize X is the height 
of a balanced (n + 1)-leaf search tree. By Theorem 3, h = [log.[(n + 1)/2]] 
= [log(n + 1)]— 1. = 


Example 4 


A well-known logical puzzle has n coins, one of which is counterfeit, too light 
or too heavy, and a balance to compare the weight of any two sets of coins 
(the balance can tip to the right, to the left, or be even). For a given value of 
n, we seek a procedure for finding the counterfeit coin in a minimum number 
of weighings. Sometimes one is told whether the counterfeit coin is too light 
or too heavy. If we are told that the fake coin is too light, how many weighings 
are needed for n coins? 

Our testing procedure will form a tree in which the first test is the root, 
the other tests are the other internal vertices, and the solutions, that is, which 
coin is counterfeit, are the leaves. See the testing procedure in Figure 3.5 for 
eight coins. The coins are numbered 1 through 8, and the left edge is followed 
when the left set of coins in a test is lighter, the middle edge when both sets 
have the same weight, and the right edge when the right set of coins is lighter. 
Note that when weighing 4 and 5 (and already knowing that 1, 2, 3, 6, 7, 8 
are not the light coin), the balance cannot be even. 

The test tree is ternary, and with n coins there will be n leaves—that is, 
n different possibilities of which coin is counterfeit. Theorem 3 tells us that 
the test tree must have height at least [log;7 | to contain n leaves. For the light 
counterfeit coin problem, this bound can be achieved by successively dividing 
the current subset known to have the fake coin into three almost equal piles 
and comparing two of the piles of equal size, as in Figure 3.5. 

If the counterfeit coin could be either too light or too heavy, then the 
problem is harder (see Exercise 25). A particular coin will appear at 2 leaves 
in the test tree, once when the coin is determined to be too light and another 
time when too heavy. m= 


We conclude this section with a formula for the number of different undi- 
rected trees on 7 distinct items. Let the items be the numbers | through n. For 
example, there are three different labeled trees on three items. Each 3-item tree 
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is a path of two edges with the difference being which of the three items is 
the one non-leaf vertex: 1-2-3, 1-3-2, and 2-1-3 (switching the position of 
the two leaves does not produce a different tree). The formula was first proved 
by Cayley in 1889. We present a simpler proof due to Prufer. 


Theorem 4 
There are n”~? different undirected trees on n items. 


Proof 


Observe that n"~* is the number of sequences of the 7 items of length n — 2. 
We now construct a one-to-one correspondence between trees on n items and 
(n ~ 2)-length sequences of the n items. Recall that for simplicity, we let the 
items be the numbers 1, 2,..., 7. 

For any tree on n numbers, we form a sequence (5), 52, ... , S,-2) of length 
n — 2 as follows. Let J; be the leaf in the tree with the smallest number and 
let s, be the number of the one vertex adjacent to it. For the tree in Figure 3.6, 
the leaf with the smallest number is 1 and the number of its neighboring vertex 
is 6. So s,; = 6. We delete vertex /, from the graph and repeat this process. 
For the tree in Figure 3.6, /,, the smallest numbered leaf in the tree after 1 is 
deleted, is 4 and its neighbor is 2. So s, = 2. Continuing we have /; = 5 and 
8; = 2, then J, = 2 and s, = 3, then /; = 6 and s; = 3, and then |; = 7 and 
5s = 3. We stop when the remaining tree has been reduced to two leaves joined 
by an edge. The 6-item sequence for the 8-item tree in Figure 3.6 is thus (6, 
2, 2, 3, 3, 3). Such sequences are called Prufer sequences. 

Next we show that any such (7 — 2)-length sequence of n items defines 
a unique n-item tree. We simply reverse the procedure in the preceding para- 
graph used to build the sequence. Observe that leaves (vertices of degree 1) 
will never appear in the sequence. The first number of the sequence is the 
neighbor of the smallest numbered leaf. From what we just observed, this 
smallest numbered leaf is the smallest number that does not appear in the 
sequence. For the sequence (6, 2, 2, 3, 3, 3), 1 is the smallest number not in 
the sequence and so 1 is the leaf with 6 as its neighbor. 

Now we set the smallest leaf (item 1) aside (its position in the tree—a leaf 
adjacent to the first item in the sequence—is determined) and we consider the 
first item (item 6) as a leaf that will be adjacent to some item in the remaining 
sequence. We then repeat the process of identifying the smallest leaf in the 
remaining (” — 1)-item tree specified by the remaining (n — 3)-item sequence. 
For the remaining sequence (2, 2, 3, 3, 3), item 4 is the smallest of the remaining 
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numbers (item 1 has been deleted) not in the sequence and so item 4 is a leaf and 
it is adjacent to item 2, the first number of the remaining sequence. Continuing in 
the reduced sequence (2, 3, 3, 3), item 5 is the smallest leaf of the remaining 
numbers and it is adjacent to item 2. Now item 2 becomes a potential “‘leaf”’ 
with respect to the remaining sequence (3, 3, 3). Note that the available leaves 
are currently items 2, 6, 7, 8. So item 2, the smallest available leaf, is adjacent 
to item 3. In the sequence (3,3), item 6 is adjacent to item 3. Finally item 7 is 
adjacent to item 3. There remain items 3 and 8 and they must be adjacent to 
each other. 

Thus the correspondence between n-item trees and sequences of length 
n — 2 is one-to-one and the theorem is proved. 


3.1 EXERCISES 


Mb Summary of Exercises Exercises 3—15 present theory about 


trees. Exercises 16-25 involve various modeling problems with trees. 
1, Draw all nonisomorphic trees with: 
(a) Four vertices (b) Five vertices (c) Six vertices 


2. Suppose a connected graph has 20 edges. What is the maximum possible 
number of vertices? 


3. Show that all trees are 2-colorable. 


pS 


. Show that all trees are planar. 

5. Show that an undirected connected graph G is a tree (i.e., has a unique 
path from root to each vertex) if any one of the following conditions hold: 
(a) G has no circuits. 

(b) G has fewer edges than vertices. 
(c) Removal of any edge disconnects G. 

6. Reprove Theorem 1 by using the fact that trees are planar (Exercise 4) and 
Euler’s formula (Theorem 2 in Section 1.4). 

7. Show that any tree with more than 1 vertex has at least 2 vertices of de- 
gree 1, 

8. Prove the following parts of the Corollary to Theorem 2: (a) Part a. 
(b) Part b. (c) Part c. 

9. Reprove that / < m’ in an m-ary tree of height h by counting the maximum 


possible number of choices at each internal vertex when building a path 
from the root to a leaf. 


10. What is the maximum number of vertices (internal and leaves) in an m- 
ary tree of height h? 


11. Show that the fraction of internal vertices in an m-ary tree is about 1/m. 
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. A forest is an unconnected graph that is a disjoint union of trees. If G is 


an n-vertex forest of ¢ trees, how many edges does it have? 


. Show that the sum of the level numbers of all / leaves in a binary tree is 


at least /[log,/], and hence the average leaf level is at least [log,/ ]. 


. Let T be an undirected tree. If the choice of vertex x to be the root yields 


a rooted tree of minimal height, then x is call a center of T. Show that any 
undirected tree has at most two centers. 


. Show that the chromatic polynomial of an n-vertex tree is k(kK—1)"' (see 


Exercise 7 in Section 2.3). 


. Any m-ary tree, m = 3, can be ‘‘converted’’ into a binary tree by the 


following substitution. 


1 ¥2 3 Ym ¥3 


(a) Perform this conversion for the tree in Figure 3.5. 


(b) If an m-ary tree has height h, what is the maximum possible height 
after conversion? 


- Consider the problem of summing n numbers by adding together various 


pairs of numbers and/or partial sums, for example, {[(3 + 1) + (2 + 5)] 

+ 9}. 

(a) Represent this addition process with a tree. What will internal vertices 
represent? 

(b) What is the smallest possible height of an ‘‘addition tree’’ for summing 
100 numbers? 


. What type of search procedure is represented by the search tree below? 


. A tree can be used to represent a binary code; a left branch is a 0 and a 


right branch a 1. The path to a letter (vertex) is its binary code. To avoid 
confusion, one sometimes requires that the initial digits of one letter’s code 
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20. 


21. 


22. 


23. 


24. 


25. 


cannot be the code of another letter (e.g., if K is encoded as 0101, then no 
letter can be encoded as 0 or 01 or 010). Under this requirement, which 
vertices in a tree represent letters? How many letters can be encoded using 
n-digit binary sequences? 

Suppose that each player in a tennis tournament (like the binary-tree tourna- 
ment in Example 1) brings a new can of tennis balls. One can is used in 
each match and the other can is taken by the match’s winner along to the 
next round. Use this fact to show that a tennis tournament with n entrants 
has n — 1 matches. 


Consider a tennis tournament T (with the tree structure illustrated in Figure 
3.2) with 32 entrants. 


(a) How many players are eliminated (lose) in the first two rounds of 
matches? 


(b) Suppose that the losers in the first two rounds of the tournament qualify 
for a losers tournament 7’. How many players are eliminated in the 
first two rounds of matches in 7’? Note that all tennis tournaments 
are balanced trees; the number of people playing matches in each round 
after the first round is a power of 2. 


(c) Suppose that the people who lose in the first two rounds of T’ qualify 
for another losers tournament 7”. How many players are eliminated in 
the first two rounds of matches in T”? 


(d) Suppose that the people who lose in the first two rounds of T” qualify 
for another losers tournament and so on until finally there is just one 
grand loser (the last tournament has two people). How many losers 
tournaments are required to determine this grand loser? 


Suppose that a chain letter is started by someone in the first week of the 
year. Each recipient of the chain letter mails copies on to five other people 
in the next week. After six weeks, how much money in postage (30¢ a 
letter) has been spent on these chain letters? 

(a) Repeat Example 3 assuming now that only a two-way branch (less 
than, greater than or equal to) is available. Draw a balanced search 
tree for the first 13 letters and determine the height of an n-letter search. 

(b) Suppose a two-way branching search tree for letters A, B, C, D, E is 
to take advantage of the following letter frequencies: A 20%, B 20%, 
C 30%, D 10%, E 20%. Build a two-way tree that minimizes the 
average number of tests required to identify a letter. 


(a) Repeat Example 4 for 20 coins with at most one too light. 
(b) Prove by induction that 3” or fewer coins with one too light can be 
tested in at most n weighings to find out which one is too light. 


Suppose we have four coins and possibly one coin is either too light or 
too heavy (all four might be true). 
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(a) Show how to determine which of the nine possible situations holds 
with just two weighings, if given one additional coin known to be true. 


(b) Show that two weighings are not sufficient without the extra true coin. 


26. In the proof of Theorem 4, we showed that a Prufer sequence (8), ©, ..., 
S,-2) uniquely described a tree on n items. Construct the trees with the 
following Prufer sequences. 


(a) (4, 5, 6, 2) (b) (2, 8, 8, 3, 5, 4) (c) (3, 3, 3, 3, 3, 3) 


3.2 DEPTH-FIRST AND BREADTH-FIRST 
SEARCH 


Trees provide a natural framework for finding solutions to problems that involve 
a sequence of choices, whether finding one’s way out of a maze or searching 
all possible solutions for the cheapest solution to a vehicle routing problem. 
Most of the problems in the two preceding chapters—isomorphism, Hamilton 
circuits, minimal colorings, and placing police on street corners—require tree- 
based searching for computerized solutions. Whether we want to find one 
solution, all solutions, or an optimal solution, the first and foremost challenge 
is to be sure that we check all possible ways to generate a solution, that is, the 
enumeration must be complete. By letting the sequential choices be internal 
vertices in a rooted tree and the solutions and ‘‘dead ends’’ be the leaves, we 
can organize our enumeration of possible solutions. Trees also provide an 
excellent framework for implementing shortcuts. 

In this section we present examples of tree enumeration that involve games 
rather than operations research applications. Most operations research tree enu- 
meration problems involve very large trees and use special tree ‘‘pruning”’ 
algorithms. As an example of such applications, we solve a small Traveling 
Salesperson problem in Section 3.4. The next chapter, ‘‘Network Algorithms,”’ 
discusses two important optimization algorithms that implicitly use trees to 
search through graphs. 

There are two basic approaches to tree enumeration. The method, called 
depth-first search or backtracking, builds a path from the root as far as 
possible in the tree. Each new edge must lead to a new vertex (not already in 
the tree). The path building stops when we reach a vertex from which there is 
no edge leading to a new vertex. Such a vertex is a leaf of the depth-first tree. 
If the leaf is not a solution or if we must continue to find all other solutions, 
we backtrack from the leaf to the parent of the leaf (the previous choice) and 
then try another edge to build a new path as far as possible. In backtracking, 
if all edges from the previous vertex have been tried, then we backtrack up 
one level higher, and so forth. Eventually this method will generate paths to 
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all leaves—that is, enumerate the whole tree of possible sequences of choices. 
If we need only one solution, a depth-first search will terminate as soon as it 
finds a vertex that is a solution. 

Remember that we can never use an edge going to a vertex already in the 
search tree. In a maze, this can lead to going around and around some circuit 
forever. The consequences in real-world problems are just as bad. 


Example 1 


Consider the maze in Figure 3.7. We start at the location marked with an S 
and seek to reach the end marked with an E. 

We use a depth-first search. For mazes, there is a convenient rule of thumb 
(whose verification is left as an exercise) for constructing a depth-first search: 
stick to the right wall in the maze. When we come to a dead-end, we follow 
the right wall to the end wall, along the end wall, and then backtrack along 
the left wall (now the right wall as we leave the dead end). When we come to 
a previously visited corner, we put an artificial (wiggly) dead-end wall to stop 
us from actually reaching that corner (as at S in Figure 3.7). In the maze in 
Figure 3.7, we use solid lines to indicate the (forward) pathbuilding and dashed 
lines for backtracking. Because the maze is easily searched directly, we have 
not drawn the search tree for this problem (in which S would be the root, other 
corners internal vertices, and dead ends and E the leaves). m= 


The other common method of tree enumeration, called breadth-first 
search, is to determine all edges leaving the root, that is, all possible children 
of the root; then determine all edges leaving these children; and so on. This 
procedure fans out uniformly from the root. Again, no vertex can be repeated. 
Because a breadth-first tree incorporates vertices into the tree as soon as possible, 
a breadth-first tree contains shortest paths from the root to each other vertex. 

If the tree of possible paths is large, then the breadth-first method quickly 
becomes unwieldy. The depth-first method that only traces one path at a time 


Figure 3.7 


3.2. Depth-First and Breadth-First Search 105 


is much easier to use by hand or to program. Further, in cases where we need 
to find only one of the possible solutions, it pays to go searching all the way 
down a path for a solution rather than to take a long time building a large 
number of partial paths, only one of which in the end will actually be used. 
On the other hand, when we want a solution involving a shortest path or when 
there may be very long dead-end paths (while solution paths tend to be relatively 
short), then the breadth-first method is better. All the network optimization 
algorithms in the next chapter use breadth-first searches. 


Example 2 


Suppose we are given three pitchers of water, of sizes 10 quarts, 7 quarts, and 
4 quarts. Initially the 10-quart pitcher is full and the other two empty. We can 
pour water from one pitcher into another, pouring until the receiving pitcher 
is full or the pouring pitcher is empty. Is there a way to pour among pitchers 
to obtain exactly 2 quarts in the 7- or 4-quart pitcher? If so, find a minimal 
sequence of pourings to get 2 quarts in the 7-quart or 4-quart pitcher. 

The positions, or vertices, in this enumeration problem are ordered triples 
(a, b, c), the amounts in the 10-, 7-, and 4-quart pitchers, respectively. Actually 
it suffices to record only (b, c), the 7- and 4-quart pitcher amounts, since a = 
10 — b — c. A directed edge corresponds to pouring water from one pitcher 
to another. Let us draw the tree on a b, c-coordinate grid as shown in Figure 
3.8a. The grid is bounded by b = 7, c = 4,b + c = 10. Pouring between the 
10- and 7-quart pitchers will be a horizontal edge, between 10- and 4-quart 
pitchers a vertical edge, and between 7- and 4-quart pitchers a diagonal edge 
with slope —1. The beginning of the same tree is shown in the standard form 
in Figure 3.8b. 

The root of this search tree is (0, 0). Since we want a minimal sequence 
of pourings (a shortest path in the search tree), we will use a breadth-first 
search. From the root, we can get to positions (7, 0) and (0, 4). From (7, 0), 
we can get to new positions (7, 3) and (3, 4), and from (0, 4), we can get to 
new positions (6, 4) and (4, 0). The tree built thus far is shown in Figure 3.8). 


(0, 4) (2, 4) (3, 4) (4, 4) (6, 4) 
(7, 3) (10, 0, 0) 
(0, 3) 
(6, 0, 4) (3, 7, 0) 
(7, 1) 
7, 0) (6, 4, 0) (0, 6, 4) (0, 7, 3) (3, 3, 4) 


273 44 5 
(3, 0) (4, 0) (6, 0) 
(a) (b) 

Figure 3.8 
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From (7, 3), the only new position is (0, 3), and from (3, 4), the only new 
position is (3, 0). From (6, 4), the only new position is (6, 0), and from (4, 0), 
the only new position is (4, 4). We have now checked all paths of length 3. 
The only new moves now are from (4, 4) to (7, 1) and from (6, 0) to (2, 4). 
But (2, 4) has 2 quarts in the 7-quart pitcher. So (0, 0) to (0, 4) to (6, 4) to 
(6, 0) to (2, 4) is a sequence of pourings to obtain 2 quarts. = 


Suppose we have built a tree to provide the framework for searching or 
organizing information. Searching for a particular vertex or processing informa- 
tion in the tree normally involves a depth-first type of traversal of the spanning 
tree. However, there are several times during a traversal when internal vertices 
can be checked. A preorder traversal of a tree is a depth-first search that 
examines an internal vertex when the vertex is first encountered in the search. 
A postorder traversal examines an internal vertex when last encountered 
(before the search backtracks away from the vertex and its subtree). If a tree 
is binary, we can define an inorder traversal that checks an internal vertex in 
between the traversal of its left and right subtrees. Figures 3.9a and 3.9b display 
numberings of the vertices of the tree in Figure 3.9a according to preorder and 
postorder traversals, respectively. (We assume the left child is visited before 
the right child at each internal vertex.) 

An important property of preorder and postorder traversals is that in a 
preorder traversal a vertex precedes its children and all its other descendants, 
whereas in a postorder traversal a vertex follows its children and all its other 
descendants. 

We now give examples of each type of traversal. In the binary search tree 
example of a compiler dictionary look-up (Example 3 in Section 3.1 and 
Example 3 in Section 1.1), the alphabetical order of the vertices corresponds 
to inorder traversal. In searching through a graph for a vertex with a specified 
label, we should test each new vertex as it is encountered on a depth-first search 
to see if it has the desired label. There is no advantage to postponing such 
testing. Thus, in searching for a special vertex, vertices should be examined 
according to a preorder traversal. 

We demonstrate the need for postorder traversal with an arithmetic tree. 
The arithmetic expression 


(a+b) +c) xX (q+ @+e))—f) + 8) 


Figure 3.9 
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Figure 3.10 


can be decomposed into a binary tree as shown in Figure 3.10. The internal 
vertices are arithmetic operations and the leaves variables. To evaluate this 
arithmetic expression, we need to execute the operations specified by internal 
vertices according to a postorder traversal of the arithmetic tree, that is, we 
cannot perform an operation until the subexpressions represented by the opera- 
tion vertex’s two subtrees are evaluated. 

We conclude this section with a more complicated depth-first search 
problem. 


Example 3 


Find all ways to place eight nontaking queens on an 8 X 8 chessboard. Recall 
that one queen can capture another queen if they are both in the same row or 
the same column or on a common diagonal. 

We develop an algorithm using depth-first search enumeration to print all 
solutions for placing eight nontaking queens. We will try all ways to place 
nontaking queens successively in column 1, column 2, and so forth, through 
column 8. Let a, be the row of the queen in column k. For each i, 1 Si < k, 
we require a, # a; (two queens cannot be in the same row) and |a, — a,| # 
k — i (two queens cannot lie on a common diagonal). When these conditions 
hold for a,, we say that ‘‘a, is compatible with a, a:, ..., a-)."” When a 
compatible a, is found, we descend in the tree to the next level (next column). 
When no compatible a, is found, we backtrack and try the next larger value 
(row) for a compatible a,_,. The algorithm can then be written as follows: 


DESCEND: k—k+13a,<— 1; 
ADDQUEEN: wuitet a = 8 AND a is not compatible with a, .. . , a 
Do a<-—a& +1; 
IF @ = 9 THEN 
ip k > 1 THEN GOTO BACKTRACK eEtse Goto END; 


IF K<8 THEN GoTO DESCEND ELsE PRINT Solution a, &, ... , &s; 


BACKTRACK: k —k — 1; a, — a& + 1; Goto ADDQUEEN; 
END: stop; 


If a, = 9, then there was no new a, compatible with a,, ..., a,-; found in the 
preceding WHILE loop and we must backtrack (or if k = 1, the search is 
finished). m= 
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3.2 EXERCISES 


Mb Summary of Exercises The first seven exercises are based on 


the three examples in this section. Exercises 8—9 involve traversals. Exercises 


10-— 


13 seek search algorithms for solving various games; use the depth-first 


method for searching positions in these games. 


1. 


10. 


(a) Use a depth-first search to find your way through this maze from S 
to E. 


(b) How many ways through this maze (without cycles or backing up) are 
there? 


ae) as 


E 


. Perform a breadth-first search of the graph in Figure 3.7 to find the shortest 


path from S to E. 
Repeat Example 2 using a depth-first search. 


. (a) Repeat Example 2 with pitchers of size 8, 5, and 3 with an objective 


of 4 quarts in one pitcher. 


(b) Repeat Example 2 with pitchers of sizes 12, 8, and 5, and an objective 
of 7 quarts in one pitcher. 


. Find another (actually shorter) way in Example 2 to get 2 quarts in one 


pitcher. 


. Use a depth-first search in Example 2 to show that any amount between 


O and 10 quarts can be obtained in one of the pitchers. 


. Show that the stick-to-the-right-hand-wall rule will always get one out of 


a maze. 


. List the vertices in order of a preorder traversal and a postorder traversal 


of: 
(a) Figure 3.1a (b) Figure 3.1b (c) Figure 3.4 


. Generalize the arithmetic tree in Figure 3.10 to include unary operations 


such as inverses or sin( ). Give the tree for the following expression: sin(((a 
+ ((b X c)! + (a + d) X e))) —(€ +e) + a — BY) 

Suppose there are a dog, a goat, and a bag of tin cans to be transported 
across a river in a ferry that can carry only one of these three items at once 
(along with a ferry driver). If the dog and goat cannot be left alone on a 
shore (when the driver is not present) nor can the goat and tin cans be left 
alone, then find a scheme for getting all across the river. 
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11. Three jealous wives and their husbands come to a river. The party must 
cross the river (from near shore to far shore) in a boat that can hold at 
most two people. Find a sequence of boat trips that will get the six people 
across the river without ever letting any husband be alone (without his 
wife) in the presence of another wife. 


12. (a) Repeat Exercise 11 this time with four jealous wives and their husbands, 
if possible. 


(b) Do part (a) with a three-person boat. 


13. (a) Suppose three missionaries and three cannibals must cross a river in 
a two-person boat. Show how this can be done so that at no time do 
the cannibals outnumber the missionaries on either side (unless there 
are no missionaries on a shore). 


(b) Repeat part (a) with the additional condition that only one of the 
cannibals can row (all the missionaries can row also). 


14, (a) Write a program for finding all ways to place eight nontaking rooks 
on a chessboard. 


(b) Write a program for finding all ways to place eight nontaking bishops 
on a chessboard. 


(c) What is the largest number of nontaking bishops that can be placed 
on a chessboard? 


15. (a) Write a program for finding all ways to place four nontaking kings on 
a4 X 4 chessboard. 


(b) Write a program for finding all ways to place four nontaking knights 
on a 4 X 4 chessboard. 


16. Write a program for finding the minimal number of queens such that all 
squares on the 8 X 8 chessboard are under attack by one of the queens. 


17. Suppose that during a preorder traversal of a binary tree 7, we write down 
a 1 for each internal vertex and a 0 for each leaf in the traversal, building 
a sequence of Is and Os. If T has n leaves, the sequence will have n Os 
and n—1 1s. We call this sequence the characteristic sequence of T. (Such 
a sequence determines a unique tree.) 


(a) Find the binary tree with the characteristic sequence 110100110100100. 

(b) Prove that the last two digits in any characteristic sequence are Os 
(assuming n = 2). 

(c) Prove that a binary sequence with n Os and n—1 Is, for some n, is a 
characteristic sequence of some binary tree if and only if the first k 


digits of the sequence contain at least as many 1s as Os, for 1 <k =< 
2n—2. 


——________y 


110 Chapter 3 Trees and Searching 


3.3 SPANNING TREES 


In many applications, graph algorithms are needed to test whether a graph has 
acertain property, such as connectedness or planarity, or to count all occurrences 
of a given structure, such as circuits or complete subgraphs. The algorithms 
usually employ a spanning tree in searching among vertices and edges of a 
graph to check for these properties or structures. A spanning tree of a graph 
G is a subgraph of G that is a tree containing all vertices of G. Spanning trees 
can be constructed either by depth-first (backtrack) search or by breadth-first 
search. 

To build a depth-first spanning tree, we pick some vertex as the root and 
begin building a path from the root composed of edges of the graph. The path 
continues until it cannot go any further without repeating a vertex already on 
the path. The vertex where this path must stop is a leaf. We now backtrack to 
the parent of this leaf and try to build a path from the parent in another direction, 
and so on, as in the depth-first search method described in Section 3.2. 

To build a breadth-first spanning tree, we pick some vertex x as the root 
and put all edges leaving x (along with the vertices at the ends of these edges) 
in the tree. Then we successively add to the tree the edges leaving the vertices 
adjacent from x, unless such an edge goes to a vertex already in the tree. We 
continue this process as in the breadth-first search method described in Sec- 
tion 3.2. 

It is important to note that if the graph is not connected, then no spanning 
tree exists. We thus have the following result. 


Algorithm to Test Whether an Undirected Graph 
Is Connected 


Use a depth-first or breadth-first search to try to construct a spanning tree. If 
all vertices of the graph are reached in the search, a spanning tree is obtained 
and the graph is connected. If the search does not reach all vertices, the graph 
is not connected. . 


A formal proof that depth-first (breadth-first) searching does search all 
vertices in a connected graph is left as an exercise. 

An adjacency matrix of an (undirected) graph is a (0,1)-matrix with a 1 
in entry (i, j) if vertex x; and vertex x; are adjacent; entry (i, j) is 0 otherwise. 


Example 1 


Is the undirected graph G whose adjacency matrix is given in Figure 3.11a 
connected? 

Let us perform a depth-first search of G starting with x, as the root. At 
each successive vertex, we pick the next edge on the tree to be the edge going 
to the lowest numbered vertex not already in the tree. So from x, we go to x. 


3.3 Spanning Trees 111 


Xi X. xX, Xe Xs Xe HX Xe 
m/O 1 1 1 0 1 0 0 
%/1 0 1 0 10 i1 0 
*%;}/1 100 0 0 0 0 
x%/1 0 00 0 0 1 0 
x/0 10 0 0 0 1 0 
x%/1 00 00 0 0 0 
«4/0 1 01 1 0 0 1 
*%!0 00 0 0 0 1 0 
(a) 
~1 x7 
x2 x2 
*6 x3 x4 x6 
x3 x5 Ps Pa x7 
3 
x4 xg 
Figure 3.11 (b) (c) 


From x, we go to x3. Since x; is not adjacent to any other vertex besides x, and 
X, (which are already in the tree), we backtrack from x; to x, and continue the 
search from x,, going to xs, then to x;, and then to x4. At x, we backtrack to Xy 
and go to xg. From x, we must backtrack all the way back to x). From x, we 
80 to xs. This finishes the search—all vertices have been visited. The spanning 
tree obtained is shown in Figure 3.11b. 

The result of a breadth-first search is shown in Figure 3.11c. = 


The computation time required to make a depth-first search of a graph is 
Proportional to the number of edges in the graph (each edge in the spanning 
tree is traversed twice, and edges that cannot be used are tried just twice, once 
from each end vertex). If an undirected graph is not connected, then we can 
find its components (connected pieces) by applying a depth-first (or breadth- 
first) search at any vertex to find one component; then apply this search starting 
at a vertex not on the previous tree to find another component; and continue 
finding additional components until no unused vertices are left. 

In many important spanning tree applications, such as linking together a 
collection of computers into a connected network, the problem is not, Is there 
a spanning tree? but rather, What is the shortest (cheapest) spanning tree? That 
is, given an undirected graph with lengths associated with each edge, we want 
to find a minimal spanning tree, a spanning tree whose sum of the edge lengths 
is as small as possible. There are two straightforward ‘“greedy’’ algorithms for 
finding a minimal spanning tree. Let n be the number of vertices in the graph. 
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Kruskal’s Algorithm 


Repeat the following step until the set T nhag n— 1 edges (initially T is empty): 
Add to T the shortest edge that does not form a circuit with edges already 
in T. 


Prim’s Algorithm 

Repeat the following step until tree T has n — 1 edges: Add to T the shortest 
edge between a vertex in T and a vertex not in T (initially pick any edge of 
shortest length). 


In both algorithms, when there is a tie for the shortest edge to be added, 
any of the tied edges may be chosen. A proof of the validity of Kruskal’s 
algorithm is given in Exercise 24. 


Example 2 


We seek a minimal spanning tree for the network in Figure 3.12. Both algorithms 
start with a shortest edge. There are three edges of length 1: (a, f), (/, q), and 
(r, w). Suppose we pick (a, f). If we follow Prim’s algorithm, the next edge 
we would add is (a, b) of length 2, then (f, g) of length 4, (g, /), then (/, q), 
then (/, m), and so forth. The next-to-last addition would be either (m, n) or 
(o, t) both of length 5 [suppose we choose (m, n)], and either one would be 
followed by (n, 0). The final tree is indicated with darkened lines in Figure 
3.12. 

On the other hand, if we follow Kruskal’s algorithm, we first include all 
three edges of length 1: (a, f), (J, q), (r, w). Next we would add all the edges 
of length 2: (a, b), (e, j), (g, 1), (A, i), (Lm), (p, 4), (8, x), % y). Next we 
would add almost all the edges of length 3: (c, A), (d, e), (k, 1), (k, p), (q, ), 
(r, 5), (V, w), but not (w, x) unless (7, 5s) were omitted [if both were present 
we would get a circuit containing these two edges together with edges (r, w) 
and (s, x)]. Next we would add all the edges of length 4 and finally either (m, 


4 
d 
3 


Figure 3.12 ¢ 
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Figure 3.13 


n), or (0, t) to obtain the same minimal spanning tree(s) produced by Prim’s 
algorithm. This similarity is no coincidence (see Exercise 27). m 


The difficult part in the minimal spanning tree problem is proving the 
minimality of the two algorithms. We give the proof for Prim’s algorithm and 
leave Kruskal’s as an exercise (Exercise 24). 


Theorem 
Prim’s algorithm yields a minimal spanning tree. 


Proof 


For simplicity, assume that the edges all have different lengths. Let T’ be a 
minimal spanning tree chosen to have as many edges as possible in common 
with the tree T* constructed by Prim’s algorithm. 

If T* # T’, let e, = (a, b) be the first edge chosen by Prim’s algorithm 
that is not in 7’. This means that the subtree 7,-,, composed of the first 
k — 1 edges chosen by Prim’s algorithm, is part of the minimal tree 7’. In 
Figure 3.13, edges of T,-, are in bold, e,’s edge is dashed, and the other edges 
of T’ are drawn normally. Since e; is not in the minimal spanning tree T’, there 
is a path, call it P, in T’ that connects a to b. At least one of the edges of P 
must not be in 7;,-,, for otherwise P U e, would form a circuit in the tree T, 
formed by the first k edges in Prim’s algorithm. Let e* be the first edge along 
P (starting from a) that is not in 7;-; (see Figure 3.13). Note that one end 
vertex of e* is in T;_,. 

If e* is shorter than e,, then on the kth iteration, Prim’s algorithm would 
have incorporated e* not e,. If e* is greater than e,, we remove e* from T’ 
and replace it with e,. The new T’ is still a spanning tree (see Exercise 23 for 
details), but its length is shorter—this contradicts the minimality of the origi- 
nal T’. @ 


3.3. EXERCISES 


Z Summary of Exercises The first six exercises involve building 


Spanning trees. The next five exercises involve minimal spanning trees. Exer- 
cises 12-18 are theory questions associated with spanning trees. Exercises 
19-28 are variations and theory questions about the two minimal spanning tree 
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algorithms. Exercises 29-32 ask for computer programs. The last exercise is 
a tricky theory question. 
1. Find depth-first spanning trees for each of these graphs; 
(a) K, (a complete graph on 8 vertices). 
(b) The graph in Figure 2.5 in Section 2.2. 
(c) The graph in Figure 2.10 in Section 2.3. 


Find breadth-first spanning trees of each of the graphs in Exercise 1. 
3. Find all spanning trees (up to isomorphism) in the following graphs: 
(a) Figure 2.4 (b) K, (c) y 7 


d e 
4. Test the graph whose adjacency matrix is given below to see if it is 
connected. 


X, X, Xz, Xe Xs Xe X7 Xe 


x)/O0 0 10041 0 1 
»}/O 0 10 1 0 1 =0 
%l1lo1041 00 0 0 
uj/O O01 0 1 1 1 ~0 
x}/O 101 0 00 1 
x|1 00 1 0 0 0 0 
x|O 101 0 0 0 1 
x!1 0001 0 1 0 
5. Consider an undirected graph with 25 vertices x, x3, ..., X25 with edges 


(x;, x;) if and only if integers i and j have a common divisor. How many 
components does this graph have? Find a spanning tree for each component. 

6. Show that in an n-vertex graph, a set of n — 1 edges that form no circuits 
is a spanning tree. 


7. The network below shows the paths to success from L (log cabin) to W 
(White House). The first number is the cost of building a freeway along 


10 
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the edge and the second is the number of trees (in thousands) that would 

have to be cut down. 

(a) Use Kruskal’s algorithm to find a minimal-cost set of freeways connect- 
ing all the vertices together. 

(b) Court action by conservationists rules out use of edges (c, e), (d, f), 
and (k, W). Now find the minimal-cost set of freeways. 

(c) Find two nonadjacent vertices such that the tree in part (a) does not 
contain the cheapest path between them. 

(d) Use Prim’s algorithm to find a set of connecting freeways that mini- 
mizes the number of trees cut down. 


. With reference to Exercise 7, suppose that the set of freeways must include 


city c or city d (but not necessarily both) and all the other cities. Find the 
minimal-cost set of freeways. 


. Find a minimal spanning tree for the network in Figure 4.1 using: 


11. 


12 


13. 


14, 


15. 


16. 


(a) Prim’s algorithm. (b) Kruskal’s algorithm 


With reference to Exercise 7, suppose that the governor’s summer home 
is along edge (f, 7). Find a minimal-cost set of freeways such that (f, /) is 
in that set. 


Find a maximal spanning tree (whose sum of edge lengths is maximal) for 
the network in Figure 3.12. 


Show that a connected undirected graph with just one spanning tree is a 
tree. 


(a) Prove that a depth-first search reaches all vertices in an undirected 
connected graph. 


(b) Repeat part (a) for breadth-first search. 


(a) Show that the height of any depth-first spanning tree of a graph G 
starting from a given root a must be at least as large as the height of 
a breadth-first spanning tree of G with root a. 


(b) By starting with different roots, it might be possible for a depth-first 
spanning tree of G to have a smaller height than a breadth-first spanning 
tree of G. Find a graph in which this can happen. 


A cutset is a set S of edges in a connected graph G whose removal discon- 
nects G, but no proper subset of S disconnects G. Show that any cutset of 
G has at least one edge in common with any spanning tree of G. 


A directed graph is strongly connected if there is a directed path from any 
vertex x to any vertex y (and interchanging roles of x and y, from y to x). 


(a) If a directed graph is not strongly connected, show that its vertices 
can be partitioned into (disjoint) strongly connected components— 
vertices x and y are in the same strongly connected component if and 
only if there exist directed paths from x and y and from y to x. 
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(b) Find the strongly connected components of the directed graph in: 
(i) Figure 1.8a in Section 1.1. 
(ii) The right graph of Figure 1.15 in Section 1.2. 


(c) How many strongly connected components does an m-vertex rooted 
tree have? 


. Let T be a spanning tree in a connected undirected graph G. Show that 


when any non-tree edge is added to T, a unique circuit results. 


. Prove that any spanning tree T’ of a graph G can be converted into any 


other spanning tree T” of G by a sequence of spanning trees T,, To, ..., 
T, where T’ = T,, T” = T,,, and T, is obtained from 7,_, by removing one 
edge of 7,-, that was in T’ and adding one edge in T’ (m is the number 
of edges by which T’ and T” differ). (Hint: Use induction on m.) 


. If each edge has a different cost, show that the minimal spanning tree is 


unique. 


. Modify Prim’s algorithm so that it finds a minimal spanning tree that 


contains a prescribed edge. Prove that your modification works. 


. Modify Kruskal’s algorithm so that it finds a maximal spanning tree. 
. In the proof of the Theorem, show that: 


(a) e& = (a, b) has one end vertex (a or b) in the tree T,_,. 
(b) e* has one of its end vertices in the tree T;,-,. 


. Show that the new T’ (mentioned in the last sentence of the proof of the 


Theorem) with e* replaced by e, is a tree, that is, that T’ is connected and 
circuit-free. 


. Prove that Kruskal’s algorithm gives a minimal spanning tree. 
. Let T be the spanning tree found by Prim’s algorithm in an undirected, 


connected network N. 

(a) Prove that T contains all edges of shortest length in N unless such 
edges include a circuit. 

(b) Prove that if e* = (a, b) is any edge of N not in T and if P is the 
unique path in T from a to b, then for each edge e in P, k(e) = k(e*). 

(c) Prove that part (b) characterizes a minimal spanning tree, that is, any 
spanning tree T (not just those formed by Prim’s algorithm) is a minimal 
spanning tree if and only if part (b) is always true for T. 


. Construct an undirected, connected network with 8 vertices and 15 edges 


that has a minimal spanning tree containing the shortest path between every 
pair of vertices. 


. Suppose that the edges of the undirected, connected network N are ordered 


and that in both Prim’s and Kruskal’s algorithms, when there is a tie for 
the next edge to be added, the smaller indexed edge is chosen. 


(a) Prove that the edges can be ordered so that Prim’s algorithm will yield 
any given minimal spanning tree. 


28. 


29. 


30. 


31. 


32. 


33. 
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(b) Prove that the edges can be ordered so that Kruskal’s algorithm will 
yield any given minimal spanning tree. 
(c) Prove that with ordered edges, both algorithms give the same tree. 


Given an undirected, connected n-vertex graph G with lengths assigned to 
each edge, we form a graph Gy whose vertices correspond to minimal 
spanning trees of N with two vertices v,, v, adjacent if the corresponding 
minimal spanning trees T,, T, differ by one edge, that is, T, = T, — e' + 
e” (for some e’, e”). 

(a) Produce an 8-vertex network N such that Gy is a chordless 4-circuit. 


(b) Prove that if 7, and T, are minimal spanning trees which differ by k 
edges, that is, |T\N7,| = n — k, then in Gy there is a path of length k 
between the corresponding vertices. 


(a) Write a program for building a depth-first spanning tree of a graph 
whose adjacency matrix is given. 


(b) Repeat part (a) for a breadth-first spanning tree. 


Write a computer program to implement (as efficiently as possible): 
(a) Kruskal’s algorithm (b) Prim’s algorithm 


Write a program for the Euler-cycle algorithm in Exercise 17 of Section 
2.1; use a connectedness subroutine to test whether use of an edge discon- 
nects the graph of remaining edges. 


Use a breadth-first inverted spanning tree T (edges directed from children 
to parents) to build an Euler cycle in a directed graph possessing an Euler 
circuit as follows. Starting at the root of T, trace out a path such that at 
any vertex x we choose only the edge in T to the parent of x (in T) if there 
are no other unused edges leaving x at this stage. Verify the correctness 
of the algorithm. Program this algorithm. 


Show that the shortest network linking points a, b, and c in the plane is 
obtained by drawing a fourth point x whose lines to a, b, c form 120° 
angles (as shown) and using these lines to link a, b, c. (If the lines from 
a to b and a to c form an angle of 120°, then x = a; similarly for b or c.) 


a 
120° 
120° 


120° 
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3.4 THE TRAVELING SALESPERSON PROBLEM 


In this section, we illustrate the use of trees in graph optimization problems of 
operations research. The traveling salesperson problem seeks to minimize the 
cost of the route for a salesperson to visit a set of cities and return to home. 
One seeks a minimal-cost Hamilton circuit in a complete graph having an 
associated cost matrix C. Entry c; in C is the cost of using the edge from the 
ith vertex to the jth vertex. The traveling salesperson problem arises in many 
different guises in operations research. One example is planning the movement 
of an automated drill press making holes at specified locations on printed circuit 
boards. 

We present two approaches to solving the traveling salesperson problem. 
Both approaches use trees, but in two very different ways. The first approach 
is to systematically consider all possible ways to build Hamilton circuits in 
search of the cheapest one. It uses a ‘‘branch and bound’’ method to limit the 
number of different vertices in the search tree that must be inspected in search 
of a minimal solution. Each vertex in the tree will represent a partial Hamilton 
circuit, and each leaf a complete Hamilton circuit. Note that a complete graph 
on n vertices has (n — 1)! different Hamilton circuits, for example, a 50-city 
(vertex) problem has 49! ~ 6 X 10® possible circuits. The ‘‘branch and bound”’ 
method reduces substantially the number of circuits that must be checked, 
although not enough to make it practical to solve large traveling salesperson 
problems. 

This leads to another strategy, our second approach, which is to construct 
an near-minimal solution using an algorithm that is quite fast. The huge amount 
of time required to find exact solutions to problems that involve enumeration 
of very large trees, such as the traveling salesperson problem, has led researchers 
in recent years to concentrate on heuristic, near-optimal algorithms for such 
problems. Finding a minimal coloring of any arbitrary graph or testing for 
isomorphism are other problems we have seen that have underlying search 
trees with similarly huge numbers of possible solutions. Near-minimal coloring 
algorithms have received much attention, but unfortunately, a ‘*near-isomor- 
phism’’ is usually of no value. 

We consider a small traveling salesperson problem with four vertices x, 
Xp, X3, X4. Let the cost matrix for this problem be the matrix in Figure 3.14. 
Entry c; is the cost of going from vertex (city) i to vertex j. (Note that we do 


Tol 2 3 4 

From 1 o 3 9 F 

2 3 0 6 5 

3 5 6 6 6 

Figure 3.14 4 9 7 4 @ 
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Tol 2 3 4 

From 1 o 0 6 4 

2 0 » 3 2 

3 0 10 1 

Figure 3.15 4 5 3 0 


not require cj; = c;;). The infinite costs on the main diagonal indicate that we 
cannot use these entries. A Hamilton circuit will use four entries in the cost 
matrix C, one in each row and in each column, such that no proper subset of 
entries (edges) forms a subcircuit. This latter constraint means that if we choose, 
say, entry C23, then we cannot also use cz, for these two entries form a subcircuit 
of length 2. Similarly, if entries cy; and c3, are used, then c,, cannot be used. 

We first show how to obtain a lower bound for the cost of this traveling 
salesperson problem. Since every solution must contain an entry in the first 
row, the edges of a minimal tour will not change if we subtract a constant 
value from the first row of the cost matrix (of course, the cost of a minimal 
tour will change by this constant). We subtract as large a number as possible 
from the first row without making any entry in the row negative, that is, we 
subtract the value of the smallest entry in row 1, namely 3. Do this for the 
other three rows also. We display the altered cost matrix in Figure 3.15. 

All rows in Figure 3.15 now contain a 0 entry. After subtracting a total of 
3 +3 +5 +4 = 15 from the different rows, a minimal tour using the cost 
data in Figure 3.15 will cost 15 less than a minimal tour using original cost 
data in Figure 3.14. Still the edges of a minimal tour for the altered problem 
are the same edges that form a minimal tour for the original problem. 

In a similar fashion, we can subtract a constant from any column without 
changing the set of edges of a minimal tour. Since we will want to avoid 
making any entries negative, we consider subtracting a constant only from 
columns with all current entries positive. The only such column in Figure 3.15 
is column 4, whose smallest value is 1. So we subtract 1 from the last column 
in Figure 3.15 to get the matrix in Figure 3.16. Every row and column in Figure 
3.16 now contains a 0 entry. 

The cost of a minimal tour using Figure 3.16 has been reduced by a total 
of 15 + 1 = 16 from the original cost using Figure 3.14. We can use this 
reduction of cost to obtain a lower bound on the cost of a minimal tour: A 


Tol 2 3 4 
From 1 o 0.6 3 Lower bound = 16 
2 0 ~ 3 #1 
3 0 1 20 =O 
Figure 3.16 4 5 3 0 
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To 1 Do not use C12 


Lower bound = 20 


4 
From 0 
1 
0 
foe} 
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Oo 8 WW Iw 


Aone Bg 


Figure 3.17 


minimal tour using the costs in Figure 3.16 must trivially cost at least 0, and 
hence a minimal tour using Figure 3.14 must cost at least 16. Jn general, the 
lower bound for the traveling salesperson problem equals the sum of the 
constants subtracted from the rows and columns of the original cost matrix to 
obtain a new cost matrix with a 0 entry in each row and column. 

Now we are ready for the branching part of the ‘‘branch and bound”’ 
method. We look at an entry in Figure 3.16 that is equal to 0. Say cy. Either 
we use C, or we do not use Cy. We ‘‘branch’’ on this choice. In the case that 
we do not use cp», we represent the no-c,. choice by setting c;, = ©. The 
smallest value in row 1 of the altered Figure 3.16 is now c,, = 3, and so we 
can subtract this amount from row 1. Similarly we can subtract | from column 
2. If we do not use entry ¢, we obtain the new cost matrix in Figure 3.17. 
Hence, any tour for the original problem (Figure 3.14) that does not use ¢) 
must cost at least 16 + (3 + 1) = 20. 

If we use C, to build a tour for Figure 3.16, then the rest of the tour cannot 
use another entry in row | or column 2; also entry ci. must be set equal to © 
(to avoid a subcircuit of length 2). The new smallest value of the reduced 
matrix in row 2 is 1, and so we subtract 1 from row 2 to obtain the cost matrix 
in Figure 3.18. The lower bound for a tour in Figure 3.18 using cy is now 
16+ 1=17. 

Since our lower bound of 17 is less than the lower bound of 20 when we 
do not use cy», we continue our consideration of tours using cj. by determining 
bounds associated with whether or not to use a second entry with value 0. We 
will continue to extend these partial tours using c,. as long as the lower bound 
is < 20. If the lower bound were to exceed 20, then we would have to go back 
and consider partial tours not using ¢;. Our tree of choices for this problem 
will be a binary tree whose internal vertices represent choices of the form: use 
entry c, or do not use cj (see Figure 3.20). At any stage, as long as the lower 
bounds for partial tours using cj are less than the lower bound for tours not 
using c,, we do not need to look at the subtree of possible tours not using cj. 

We extend a tour using c;. by considering another entry with value 0. This 
next 0 entry need not connect with cy), that is, need not be of the form c;, or 


Tol 3 4 Use ci 
From 2 o 2 O Lower bound = 17 
3 0 © O 
Figure 3.18 4 5 0 @ 
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To 1 3 Use cy and cy 
From 3 0 Lower bound = 17 
Figure 3.19 4 co = 


C2j, but for simplicity we shall pick an entry in row 2. The only 0 entry in row 
2 of Figure 3.18 is cy. 

Again we have the choice of using c,, or not using Cc, Not using Cz will 
increase the lower bound by 2 (the smallest entry in row 2 of Figure 3.18 after 
we set Cy = %; column 4 still has a 0). Using c., will not increase the lower 
bound, and so we further extend the partial tour using c», along with cy). Again 
we delete row 2 and column 4 in Figure 3.18 and set Cig = © (to block the 
subcircuit x,;—x,—x,—x,). Figure 3.19 shows the new remaining cost matrix (all 
rows and columns still have a 0 entry). 

Now the way to finish the tour is clear: use entries c,, and c3, for a complete 
tour X;—X)—X4—X3—x,. We actually have no choice since not using either of c,; 
or C3, forces us to use an © (which represents a forbidden edge). Since cy; = 
C3, = 0, this tour has a cost equal to the lower bound of 17 in Figure 3.19. 
Further, this tour must be minimal since its cost equals our lower bound. 
Rechecking the tour’s cost with the original cost matrix in Figure 3.14, we 
have cy + Cy ten +c) = 3 +5+44+5= 17. 

We summarize the preceding reasoning with the decision tree in Figure 
3.20 summarizing choices we made and their lower bounds (L.B.). 

One general point should be made about how to take best advantage of 
the branch-and-bound technique. At each stage, we should pick as the next 
entry on which to branch (use or do not use the entry), the 0 entry whose 
removal maximizes the increase in the lower bound. In Figure 3.16, a check 


All solutions 


Solutions 
without cy 

L.B. = 20 

Solutions Solutions 

with coq without coq 


L.B.=17 L.B.=19 
Solutions Solutions 
with c43 without c43 
L.B.=17 


No solution 


Figure 3.20 
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of all 0 entries reveals that not using entry cx will raise the lower bound by 3 
+ 3 = 6 (3 is the new smallest value in row 4 and in column 3). So cy, would 
theoretically have been a better entry than c,. to use for the first branching, 
since the greater lower bound for the subtree of tours not using c4; makes it 
less likely that we would ever have to check possible tours in that subtree. 

Now consider the 6 X 6 cost matrix in Figure 3.21. We present the first 
few branchings in Figure 3.22 for this cost matrix, and circle the 0 entry on 
which we branch at each stage. 

The branch-and-bound method is used not only to solve optimization prob- 
lems. It is also a central tool in artificial intelligence. Computers that play chess 
make a tree of possible future moves and then assign some sort of ‘‘value’’ to 
the resulting chessboard situations. A move that leads to a bad situation gets 
a very high value, and future moves from this situation are not pursued. At 
more promising situations, all reasonable next moves are considered and their 
resulting situations evaluated. The more powerful the computer, the more moves 
into the future it can examine, although the procedure for evaluating positions 
is just as important as the machine’s speed. 

Next we present a quicker algorithm for obtaining good (near-minimal) 
tours, when the costs are symmetric (i.e., cy = Cj) and the costs satisfy the 
triangle inequality—cy = cj + Cx. These two assumptions are satisfied in most 
traveling salesperson problems. After describing the algorithm and giving an 
example of its use, we will prove that at worst the quick algorithm’s tour is 
always less than twice the cost of a true minimal tour. A bound of twice the 
true minimum may sound bad, but in many cases where a ‘‘ballpark’’ figure 
is needed (and the exact minimal tour can be computed later if needed), such 
a bound is quite acceptable. Furthermore, in practice our quick algorithm finds 
a tour that is close to the true minimum. 


Quick Traveling Salesperson Tour Construction 


1. Pick any vertex as a starting circuit C, consisting of 1 vertex. 


2. Given the k-vertex circuit C,, k = 1, find the vertex z, not on C;, that is 
closest to a vertex, call it y,, on C,. 


3. Let Cy; be the k+1-vertex circuit obtained by inserting z; immediately in 
front of y, in C,. 


Tol 2 3 4 5 6 
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Figure 3.22 Tree of traveling salesperson partial solutions. 


= 18 


L.B. 
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4. Repeat steps 2 and 3 until a Hamilton circuit (containing all vertices) is 
formed. 


Example 1 


Let us apply the preceding algorithm to the 6-vertex traveling salesperson 
problem whose cost matrix is given in Figure 3.23. Name the vertices x), x2, 
X35 X4, Xs, Xg. We will start with x, as C,. Vertex x, is closest to x,;, and so C; is 
X\-X,-x, . Vertex x; is the vertex not in C, closest to a vertex in C,, namely, 
closest to x4; thus Cy = x,-x3—X;-%1. There are now two vertices, x) and xX, 3 
units from vertices in C3. Suppose we pick x,. It is inserted before x; to obtain 
Cy = X{-Xy-Xj-Xy-X, . Vertex xg is still 3 units from x), and so we insert X¢ 
before x,, obtaining Cs = x;—x)—x3;—Xy—X6—X,. Finally, x5 is within 4 units of x; 
and x, Inserting x; before x, we obtain our near-minimal tour Cy, = 
X|—Xy—-X4—Xy—-Xs—Xp—-X1, whose cost we compute to be 19. 

In this case, the length of this tour obtained with the Quick Tour Construc- 
tion algorithm is quite close to the minimum (which happens to be 18). The 
length of the tour generally depends on the starting vertex. If we suspected 
that our approximate tour was not that close to an optimal length, then by 
irying other vertices as the starting vertex (that comprises C,) and applying the 
algorithm, we will get other near-minimal tours. Taking the shortest of this set 
of tours generated by the Quick Tour Construction would give us an improved 
estimate for the true minimal tour. = 


Theorem 


The cost of the tour generated by the quick construction is less than twice the 
cost of the minimal traveling salesperson tour. 


Proof Optional 


Suppose we are successively building the k-vertex circuits C, according to 
the quick construction in the n-vertex complete graph G. Let S; be a subset of 
edges in the true minimal tour C* that connect the vertices in C, to the other 
vertices in G (S; is described precisely below). When k = 1, we let S, consist 
of C*—e*, where e* is the costliest edge in C*. Since C, is a single vertex 
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+4 x5 


ay 


xg 
(a) (b) (c) 
Figure 3.24 


and C* — e* is a Hamilton path, S, = C* — e* will connect C, with the rest 
of G. 

For concreteness, see the 8-vertex graph in Figure 3.24a, where C, is the 
vertex %, and S; = C* — e* is the Hamilton path of solid edges. If z, = x; 
and so C) is x;~X;~X,, then we set S,) = S,; — (xs, Xs) (C, is shown by dashed 
edges and S, by solid edges in Figure 3.24b). In general, we will obtain S;,,. 
from S, by removing the first edge on the path in S, from C;, to the new vertex 
z, being added to C,. In Figure 3.24c, suppose that z,) = x, and Yr = Xp. SO Xp 
is inserted into C, between x, and x, to obtain C3 = x—xg—x3—X;. Edge (%¢, x7) 
is removed from S, to get S3, since (x, x7) is the first edge on the path in S, 
from C, (specifically x6) to x. 

By the shortest edge rule for picking z, = xg and y, = x, we know that 
Cog [the cost of edge (%, xg)] is the smallest cost among all edges between C, 
(=Xs—X3—Xe) and the rest of G. Thus cg S cg [where (x, x7) was the edge 
removed from S, to get S;]. Using this fact and the triangle inequality, we shall 
now prove that inserting xg into C; to get C; has a net increase in cost = 2cg. 
The increase of C; over C, is Cog + C3g — C3¢, Since edges (x,, xg) and (13, x) 
replace (x3, X¢). But by the triangle inequality, c33 S cy, + Ceg, or equivalently 
C3g — C36 S Cog. Thus Cog + (C3g — C36) S Cog + Cog S 2Ce, as claimed. 

This same argument can be applied when we insert each successive z; into 
C;, to prove that the increase of C,., over C, is at most twice the cost of the 
edge dropped from S, to get S,,,. Starting from C, and repeating this bound on 
the insertion cost, we see that C,, the final near-optimal Hamilton circuit, is 
bounded by twice the sum of the costs of the edges in S, = C* — e*, which 
is less than twice the cost of the minimal traveling salesperson tour C*. @ 


3.4 EXERCISES 
1. Subtract the value of each nondiagonal entry in Figure 3.14 from 10. Now 


solve the traveling salesperson problem for this new cost matrix. 
2. Solve the traveling salesperson problem for the cost matrix in Figure 3.21. 
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3. 


Write a computer program to solve a six-city traveling salesperson problem 
and use it for the cost matrix in Figure 3.23. 


. Use ad hoc arguments to show that the cost of a minimal tour for the cost 


matrix in Figure 3.23 is 18. 


. Every month a plastics plant must make batches of five different types of 


plastic toys. There is a conversion cost c; in switching from the production 
of toy i to toy j, shown in the matrix. Find a sequence of toy production 
(to be followed for many months) that minimizes the sum of the monthly 
conversion costs. 


To T, T, T; Ty Ts 


From 
T; — 3 2 4 3 
Ty 4 — 4 5 6 
T 5 3 — 4 4 
T, 3 5 1 — 6 
Ts 5 4 2 3 — 


The assignment problem is a matching problem with n people and n jobs 
and a cost matrix with entry c, representing the ‘‘cost’’ of assigning person 
i to job j. The goal is a one-to-one matching of people to jobs that minimizes 
the sum of the costs. Set all diagonal entries in Figure 3.14 equal to 5 and 
solve this 4 X 4 assignment problem by a branch-and-bound approach. How 
does an assignment problem differ from a traveling salesperson problem? 


. Use the quick construction to find approximate traveling salesperson tours 


for the cost matrices in (use just entries above the main diagonal): 
(a) Figure 3.14 (b) Figure 3.21 (c) Exercise 5 


. Consider the following rule for building approximate traveling salesperson 


tours. Starting with a single-vertex tour T,, successively add the vertex 
whose insertion into T; to form T;,.; minimizes the increase in cost, that 
is, if x, is inserted between x, and x,+1 then Cy + Cras) — Che+ should be 
minimal over all choices of x, and x;. 


(a) Can you prove an upper bound on this method similar to the one found 
for the quick construction method (making the same assumptions)? 

(b) Apply this method to the cost matrix in (i) Figure 3.21 (using just 
entries above the main diagonal) and (ii) Figure 3.23. 


. Find a 3 X 3 cost matrix for which two different initial lower bounds can 


be obtained (with different sets of 0 entries) by subtracting from the rows 
and columns in different orders. 
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10. Make up a 5 X 5 cost matrix for which the Quick Tour Construction finds: 
(a) An optimal tour. 
(b) A fairly costly tour (at least 50% over the true minimum), 


—eees ho 


3.5 TREE ANALYSIS OF SORTING 
ALGORITHMS 


One of the basic combinatorial procedures in computer science is sorting a set 
of items. Items are sorted according to their own numerical value or the value 
of some associated variable. To simplify our discussion, we will assume that 
all items have distinct numerical values. 

Some of the sorting procedures we mention will use binary trees explicitly. 
All the procedures use binary testing trees implicitly, that is, the procedures 
use a sequence of tests that compare various pairs of items. A binary-testing 
tree model of sorting has one important consequence. The tree must have at 
least n! leaves if there are n items, since there are n! possible permutations, 
or rearrangements, of the set. By Theorem 3 in Section 3.1, the height of 
a binary testing tree must be at least [log,n!], which is approximately 
n logon. 


Theorem 


In the worst case, the number of binary comparisons required to sort n items 
is at least O(n logon). 


The theorem refers to the worst case, because certain sorting algorithms 
will sort some sets very quickly (corresponding to short paths in the binary 
testing tree). A function g(n) is said to be of order O(f(n)) if, for large values 
of n, g(n) = cf(n), for some constant c. One can also show that the average 
number of binary comparisons required to sort n items is at best of order O(n 
log»n) (see Exercise 3). 

The best known sorting algorithm is called bubble sort, so named because 
small items move up the list the way bubbles rise in a liquid. It is compactly 
written as: 


FOR M <— 2 To nDbDo 
FOR ic— N STEP —1 TO MDO 
IF A; < Aj_1, THEN interchange items A, and Ai; 


This procedure will always require (n — 1) + (n — Qyr.e.t1s= 
3n(n — 1) binary comparisons. Thus this procedure requires O(n) comparisons, 
as opposed to the theoretical bound of O(n log,n) comparisons. To see how 
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much faster O(n?) grows than O(n log)n), observe that for n = 50, n* = 2500, 
and n log,n ~ 300; and for n = 500, rn? = 250,000, and n logon ~ 4500. 

The simplest-to-state sorting procedure that achieves the O(n log,n) bound 
is merge sort. It recursively subdivides the original list and successive sublists 
in half (or as close to half as possible) until each sublist consists of one item. 
Then it successively merges the sublists in numerical order. The subdivision 
process is naturally represented as a balanced binary tree and the merging as 
a reflected image of the tree. 


Example 1 


Sort the list 5, 4, 0, 9, 2, 6, 7, 1, 3, 8 using a merge sort. 
The subdivision tree and subsequent ordered merges are shown in Figure 
3.25. = 


To analyze the number of binary comparisons in a merge sort, we make 
the simplifying assumption that n = 2’, for some integer r. Then the subdivision 
tree will have sublists of size 2’~! at the level-1 vertices. In general, there will 
be 2'~* items in the sublists at level k. At level r, there are leaves each with 
one item. 

In the merging tree, pairs of leaves are ordered at each vertex on level 
r — 1; this requires one binary comparison (to see which leaf item goes first). 
In general, at each vertex on level k we merge the two ordered sublists (of 
2’! items) of the 2 children into an ordered sublist of 2'-* items; this merging 
will require 2’/* — 1 binary comparisons (verification of this number is left as 


Figure 3.25 1, 2,3, 4,5, 6,7, 8,9 
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an exercise). Finally, the two ordered sublists of 2’"! items at level 1 are merged 
at the root. The number of binary comparisons at all the vertices on level k is 
2'(2"* — 1) since there are 2‘ different vertices on level k. Summing over all 
levels, we compute the total number of binary comparisons 


r-1 r-1 


By 2k2r-k -1)= > (2" — 2") 


=0 k=0 
r-1 r-t 
es >, or — > 2k 
k=0 k=0 
= r2'— (2’— 1) = (logyn)n — (n— 1) 


since n = 2’ and hence r = log)n. Thus the number of binary comparisons in 
a merge sort is O(n logyn). However, extra computer time is required to imple- 
ment the initial subdivision process. This extra work also requires only O(n 
log,n) steps. When n = 15, a bubble sort usually takes less time (and a lot less 
programming effort) than a merge sort. 

A closely related sorting procedure is called QUIK sort. In QUIK sort, 
one takes the first item A, in the list L and uses it to divide the rest of the list 
into sublists Z; and L, such that all items in L, are less than A, and all items 
in L, are greater than A,. This subdivision would require n — 1 comparisons 
(of each item against A,). Item A, is then put at the end of sublist L,. Next the 
same subdivision procedure is used with the first item in each of the two sublists 
(L, is divided into L,, and L,), and so on. Finally all sublists have only one 
item, but the sublists are ordered and by concatenating the sublists in order we 
obtain a sorted list. 

This procedure is simpler to implement than a merge sort, but it may not 
divide the lists evenly (for example, A, could be the largest item so that L, is 
empty). In the worst case, QUIK sort can be shown to require as many compari- 
sons as a bubble sort and hence is an O(n’) algorithm. But when QUIK sort 
is used with many randomly arranged lists, the average number of comparisons 
is only O(n log)n). 

We conclude this section with a more complicated tree-based sorting proce- 
dure called heap sort. A heap is a binary, or almost-binary (some internal 
vertices may have just one child) tree with each internal vertex’s value being 
numerically greater than the values of its children. Figure 3.26a shows a heap 
involving the numbers 0 through 9. The root of the heap must have the largest 
value in the set. If we remove the root of the heap (and place it at the end of 
the ordered list we are building), then we can reestablish a heap by making 
the larger child of the root the new root and recursively replacing each internal 
vertex moved up with the larger of its two children. The arrows in Figure 3.26a 
Show how vertices move up when the root 9 is removed. The new heap is 
shown in Figure 3.26b. Now we again remove the root of the heap (and make 
it the next-to-last item in the sorted list). Repeat this procedure until the heap 
is emptied and the sorted list complete. The one important missing step is 
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ta ra 
1] : if ‘ / sy rs x, 
wO\ \ 


Figure 3.26 (a) (b) 


creating the initial heap. This problem and further discussion of heap sort is 
left for the exercises. 


3.5 EXERCISES 


Mb Summary of Exercises The first exercises are based on the 


sorting methods presented in this section. The remaining exercises present other 
sorting schemes. 


1. 


Apply merge sort and QUIK sort to the following sequences: 
(a) 6, 9, 5, 0, 3, 1, 8, 4, 2,7 
(b) 15, 27, 4, —7, 9, 13, 8, 28, 12, 20, —80 


. Show that at most n — 1 comparisons are needed to merge two sorted 


sublists into a single sorted list of n items. 


_ Use the result of Exercise 13 in Section 3.1 to show that the average number 


of binary comparisons required to sort 7 items is at least O(n log, n) 


. (a) Find a way to represent QUIK’s sublist divisions with a binary tree. 


(b) Draw this tree for a QUIK sort of the list in Exercise 1(a). 


. Give an example of an n-item list for which QUIK sort requires 3n(n — 


1) comparisons. 


. (a) Describe how to build an initial heap from an unordered list of n items 


(the initial heap should be a balanced tree). 


(b) Use your method in part (a) to make a heap for the lists in Exer- 
cise 1. 


(c) Apply heap sort to the heaps in part (b). 


. Show that heap sort requires O(n log,n) comparisons to sort 7 items (this 


includes the initial construction of a heap). 


. Modify the following sorting methods to allow for repeated (two or more 


equal) items: 
(a) Bubble sort (b) Merge sort (c) QUIK sort (d) Heap sort 
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9. Write a computer implementation of (use a recursive language such as 
PASCAL): 


(a) Merge sort (b) QUIK sort (c) Heap sort 


10. Consider the following sorting scheme, which we call tree sort. We build 
a ‘‘dictionary look-up’’ binary tree recursively. The first item in the list to 
be sorted is the root. The second item is a left or right child of the root, 
depending on whether it numerically precedes or follows the root. We 
continue to add each successive item as a leaf to this growing tree. For 
example, if a list begins 8, 4, 11, 6, ..., the tree after four items would 
be 


4 11 


6 


When all items have been incorporated into the tree, the sorted order is 
obtained by an inorder traversal of the tree (inorder traversals were defined 
in Section 3.2). 


(a) Build the tree for the list in Exercise 1(a). 

(b) Compare the tree in part (a) with the tree in Exercise 4(b) for this list. 

(c) Generalize part (b) to show that tree sort is equivalent to QUIK sort. 
11. Consider the following sorting scheme for the list A;, Ay,..., A, (n = 2’). 

First do a sort (one comparison) of A; and A;.,;; call this the ith ordered 

pair, fori = 1, 2,..., n/2. Next do a merge sort of the ith and the (i+ 

n/4)th ordered pairs; call this the ith 4-tuple. Next do a merge sort of the 


ith and the (i + n/8)th ordered 4-tuples. Continue until there is just one 
sorted list of all n items. 


(a) Apply this sorting method to the list 10, 12, 7, 0, 5, 8, 11, 15, 1, 6, 
3, 9, 13, 4, 2, 14. 


(b) How many comparisons does this sorting method require for a list of 
n numbers? 


—$_____y 


3.6 SUMMARY AND REFERENCES 


his. chapter examined a variety of search and data organization problems. The 
common graph-theoretic tool for all these problems was trees. Section 3.1 
presented basic properties and terminology of trees. Section 3.2 demonstrated 
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the uses of depth-first and breadth-first search. Section 3.3 introduced spanning 
trees and minimal spanning trees. Section 3.4 gave tree-based branch-and- 
bound and heuristic approaches to a famous optimization problem, the traveling 
salesperson problem. Section 3.5 looked at the decision trees underlying sorting 
algorithms. 

The first paper implicitly using trees was Kirchhoff’s 1847 fundamental 
paper about electrical networks. Cayley was the first person to use the term 
tree in an 1857 formula for counting ordered trees (Theorem 4 in Section 3.1). 
Searching methods have been around for years (see Lucas [3]), but a systematic 
development of this subject came only in recent years with the advent of digital 
computers. There are several good computer science books about searching, 
sorting, and graph algorithms. See Aho, Hopcroft, and Ullman [1], or Sedgewick 
[4]. For a good survey of the Traveling Salesperson Problem, see Lawler et 
al. [2]. 


1. A. Aho, J. Hopcroft, and J. Ullman, Data Structures and Algorithms, 
Addison-Wesley, Reading, MA, 1983. 


2. E. Lawler, J. Lenstra, A. Rinnooy Kan, and D. Shmoys, The Traveling 
Salesman Problem, John Wiley, New York, 1985. 


3. E. Lucas, Recreations Mathematiques, Gauthier-Villars, Paris, 1891. 
4. Sedgewick, R., Algorithms in C, Addison-Wesley, Reading MA, 1990. 


CHAPTER 4 
NETWORK ALGORITHMS 


4.1 SHORTEST PATHS 


In this chapter we present algorithms for the solution of two important network 
optimization problems. By a network we mean a graph with a positive integer 
k(e) assigned to each edge e. This integer will typically represent the “‘length’’ 
of an edge, in units such as miles, or represent ‘‘capacity’’ of an edge, in units 
such as megawatts or gallons per minute. The optimization problems we shall 
discuss arise in hundreds of different guises in management science and system 
analysis settings. Thus good systematic procedures for their solution are essen- 
tial. In the case of network flows, we shall see that the flow optimization 
algorithm can also be used to prove several well-known combinatorial theorems. 
The minimal spanning tree problem, encountered in Section 3.3. is another 
important network optimization problem. 

We begin with an algorithm for a relatively simple problem, finding a 
shortest path in a network from point a to point z. We say a shortest path 
because, in general, there may be more than one shortest path from a to z. For 
the rest of this section, let us assume that all networks are undirected and 
connected. 

Let us immediately eliminate one possible shortest path algorithm: Deter- 
mine the lengths of all paths from a to z, and choose a shortest one. The 
computer is fast, but not that fast—such enumeration is already infeasible for 
most networks with 20 vertices. So when we find a shortest path, we must be 
able to prove it is shortest without explicitly comparing it with all other a—z 
paths. Although the problem is now starting to sound difficult, there still is a 
straightforward algorithmic solution. 

The algorithm we present is due to Dijkstra. This algorithm gives shortest 
paths from a given vertex a to all other vertices. Let k(e) denote the length of 
edge e. Let the variable m be a ‘‘distance counter.’’ For increasing values of 
m, the algorithm labels vertices whose minimal distance from vertex a is m. 
The first label of a vertex x will be the previous vertex on the shortest path 
from a to x. The second label of x will be the length of the shortest path from 


a to x. 
133 | 
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Shortest Path Algorithm 


1. Set m = 1 and label vertex a with (-, 0) (the ‘‘—’’ represents a blank). 


2. Check each edge e = (p, ¢) from some labeled vertex p to some unlabeled 
vertex g. Suppose p’s labels are [r, d(p)]. If d(p) + k(e) = m, label g 
with (p, m) 

3. If all vertices are not yet labeled, increment m by 1 and go to Step Zi 
Otherwise go to Step 4. If we are only interested in a shortest path to z, 
then we go to Step 4 when z is labeled. 


4. For any vertex y, a shortest path from a to y has length d(y), the second 
label of y. Such a path may be found by backtracking from y (using the 
first labels) as described below. 


Observe that instead of concentrating on the distances to specific vertices, 
this algorithm solves the questions: How far can we get in 1 unit, how far in 
2 units, in 3 units, ..., in m units, .. .? Formal verification of this algorithm 
requires an induction proof (based on the number of labeled vertices). 

The key idea is that to find a shortest path from a to any other vertex we 
must first find shortest paths from a to the “‘intervening’’ vertices. If P; = (s), 
59, ..., 8;) is a shortest path from s, to s,, then P; = P;-; + (s:-1, 8;), where 
P;-; = (51, 5, .--, 5j-1) iS a Shortest path to 5;-1. Similarly P;-,; = P:-2 + (Si-, 
5;-1), and so on. 

To record a shortest path to s;, all we need to store (as the first part of a 
label in the above algorithm) is the name of the next-to-last vertex on P,, 
namely, s;-;. Preceding s;-, on the path is s;-», the next-to-last vertex on P;-;. 
By continuing this backtracking process, we can recover all of P;. 

The algorithm given above has one significant inefficiency: If all sums 
d(p) + k(e) in Step 2 have values of at least m' > m, then the distance counter 
m should be increased immediately to m’. 


Example 1 


A newly married couple, upon finding that they are incompatible, want to find 
a shortest path from point N (Niagara Falls) to point R (Reno) in the road 
network shown in Figure 4.1. We apply the shortest path algorithm. First N is 
labeled (-, 0). For m = 1, no new labeling can be done [we check edges (N, 
b), (N, d), and (N, f)]. For m = 2, d(N) + KN, b) = 0 + 2 = 2, and we label 
b with (N, 2). For m = 3, 4, no new labeling can be done. For m = 5, d(b) 
+ k(b, c) = 2 + 3 = 5. So we label c with (6, 5). We continue to obtain the 
labeling shown in Figure 4.1. Backtracking from R, we find the shortest path 
to be N—b—c—d—h—-k-j—m-R with length 24. = 


If we want simultaneously to find the shortest distances between all pairs 
of vertices (without directly finding all the associated shortest paths), we can 
use the following simple algorithm due to Floyd. Let matrix D have entry 
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F(N, 10) 


BIN, 2) 


Figure 4.1 


d; = © (or a very large number) if there is no edge from the ith vertex to the 
jth vertex; otherwise d, is the length of the edge from x; to x;. Then Floyd’s 
algorithm is most easily stated with a computer program: 


FOR kK <—17T0NDO 
FoR if <-1T0NDO 
FoR / <1T0NbDO 
IF dx oe dy < dj; THEN dj — dix 2 dij; 


When finished, d; will be the shortest distance from the ith vertex to the 
jth vertex. 


4.1 EXERCISES 


Mb Summary of Exercises The first six exercises involve shortest 


path calculations. The remaining exercises discuss associated theory (fairly 
easy theory), and the last exercise asks for a program. 


1. Use the shortest path algorithm to find the shortest path between vertex c 
and vertex m in Figure 4.1. 


2. Find the shortest path between the following pairs of vertices in the network 
in Figure 3.12 in Section 3.3. 


(a) a and y (b) d andr (c) e and g 

3. The network below shows the paths to success from L (log cabin) to W 
(White House). The first number on an edge is the time (number of years) 
it takes to traverse the edge; the second is the number of enemies you 


make in taking that edge. Use the shortest path algorithm to answer the 
following: 


(a) Find the quickest path to success (from L to W). 
(b) Find the quickest path to success that avoids points c, g, and k. 
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(c) Find the path to success (from L to W) that minimizes the total number 
of enemies you make. 

(d) Alter the order in which edges are checked in Step 2 of the shortest 
path algorithm to get another minimal-enemies path. How many such 
minimal-enemies paths are there? 


. With reference to Exercise 3, let the roughness index R of a path to success 


be R = T + 2E, where T is the time to get from L to W and E is the total 
number of enemies made. 


(a) Find the smoothest (least rough) path to success. 


(b) Find the smoothest path to success that includes edge (f, i); this edge 
can be traversed in either direction. 


. Ignore the numbers on the edges in Exercise 3 and use the shortest path 


algorithm to find the following shortest (least edges) paths: 

(a) Shortest path from L to W. 

(b) Shortest path from L to W including vertex d. 

(c) Shortest path from L to W including both vertex e and vertex m. 


. Suppose that the edges in Exercise 3 are directed according to the alphabeti- 


cal order of the endpoints, where L precedes a and W follows m [so edge 
(r; s) goes from r to s if r precedes s in the alphabet]. Find the quickest 
path from L to W in this directed network. 


. Prove by induction on m that Dijkstra’s shortest path algorithm finds the 


shortest path from a to every other vertex in the network. 


. Prove that Floyd’s algorithm finds the shortest path between all pairs of 


vertices. 


. Make up an example to show that Dijkstra’s algorithm fails if negative 


edge lengths are allowed. 


. Alter the shortest path algorithm so that in Step 2 we search among all 


edges from a labeled vertex p to an unlabeled vertex g for an edge that 
minimizes d(p) + k(p, g). The counter m is eliminated. Supply the rest 
of the details and show that this new algorithm is equivalent to the original. 


. Show that in the shortest path algorithm the edges used in Step 2 to label 


new vertices form a spanning tree. 
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12, (a) Show that if the edges are properly ordered (and the edges are checked 
in this order in Step 2), the shortest path algorithm will produce any 
given shortest path from a to z when more than one shortest path 
exists. 


(b) Order the edges of the figure in Exercise 3 so that in Exercise 3(c), 
the ‘‘shortest’’ path found will be (L, a, c, d, f, g, k, m, W). 


13. Consider the following physical model of a shortest path problem. Make 
pieces of string of length proportional to the length of each edge [e.g., an 
edge e with k(e) = 7 is a 7-in. string]. Tie the string edges together at the 
appropriate nodes (vertices). Now take the start and end vertex of the 
network in your left and right hands, respectively, and pull them as far 
apart as possible. Show that any sequence of string-edges that form a taut 
path from a to z constitutes a desired shortest path. 


14. Show that with the ‘‘shortcut’’ in Exercise 10, the shortest path algorithm 
requires at most a number of steps proportional to the number of vertices 
times the number of edges in the network. 


15. The transitive closure of a directed graph G is obtained by adding to G 
an edge (x;; x;) for each nonadjacent pair x;, x, with a directed path from 
to x; to x; Let d; = 1 if (x;; x,) is an edge in G and = 0 otherwise. Replace 
the IF statement in Floyd’s algorithm with 


IF Ae + Aj > Aj THEN dy <— 1 


Show that this revised Floyd’s algorithm finds the transitive closure 
of G. 


16. Write a computer program implementing Dijkstra’s shortest path algorithm 
using the shortcut in Exercise 10. 


—=e 


4.22 NETWORK FLOWS 


In this section we interpret the integer k(e) associated with edge e in a network 
as a Capacity or upper bound. We seek to maximize a ‘‘flow’’ from vertex a 
to vertex z such that the flow in each edge does not exceed that edge’s capacity. 
Many transport problems are of this general form, for example, maximizing 
the flow of oil from Houston to New York through a large pipeline network 
(here the capacity of an edge represents the capacity in barrels per minute of 
a section of pipeline), or maximizing the number of telephone calls possible 
between New York and Los Angeles through a long-distance telephone network. 
It is convenient to assume initially that all networks are directed. 
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Let In(x) and Out(x) be the sets of edges directed into and out from vertex 
x, respectively. Then we define an a—z flow ¢ in a directed network N to be 
an integer-valued function g defined on each edge e—g(e) is the flow in e— 
together with a source vertex a and a sink vertex z satisfying the following 
three conditions: 


(a) 0 = gfe) = ke) 

(b) g(e) = O if e € In(a) ore € esas 

(ec) For x # a or z, SY of =k ge). 
eE€In(x) e€Out(x) 

The second condition assures that the flow goes from a to z, not in the 
reverse direction. A sample flow is shown in Figure 4.2; the capacity and flow 
in each edge e are written beside the edge: k(e), y(e). The assumption of integer 
capacities and flows is not restrictive, that is, the units we count could be 
thousandths of an ounce rather than barrels. 

Let (P, P) denote the set of all edges (x; y) with vertex x € P and y € 
P) (where P denotes the complement of P). We call such a set (P, P) 
a cut. The cut ({a, b, c}, {d, e, z}) in Figure 4.2 consists of the edges 
(b; d), (b; e), (c; e); the edge (d> c) is not in the cut because it goes from P 
to P. This cut is represented in Figure 4.2 by a dashed line that separates the 
vertices in P = {a, b, c} from the vertices in P; the edges in the cut are the 
edges crossing the dashed line from left to right. We call (P, P) an a—z cut if 
a€ PandzeP. 

Let @ be an a—z flow in the network N and let P be a subset of vertices 
in N not containing a or z. Summing together the conservation-of-flow equations 
in condition (c) for each x € P, we obtain 

> » H)=> >» ve) 
x€EP e€In(x) xEP e€Out(x) 

Certain ¢g(e) terms occur on both sides of the preceding equality; namely, 
edges from one vertex in P to another vertex in P. After eliminating such ¢(e) 
from both sides, the left side becomes ice RK g(e) and the right side Das 
g(e). Thus we have: 


(c’) For each vertex subset P not containing a or z, 


S p2= dX ge) 


e€(P,P) e€(P,P) 


Figure 4.2 
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That is, the flow into P equals the flow out of P. The following intuitive 
result is readily verified with (c’). 


Theorem 1 
For any a—z flow ¢ in a network N, the flow out of a equals the flow into z. 


Proof 


Assume temporarily that N contains no edge (a, z). Let P be all vertices in N 
except a and z. So P = {a, z}. Then (P, P) consists of edges going into z 
(from P) and (P, P) consists of edges going out of a (to P). Thus 
by (c'} 
flow out of a = > ge) = >. ye) = flow into z 
e€(P,P) e€(P,P) 

Remember that the only flow into P from {a, z} must be from a, since 
condition (b) forbids flow from z. Similarly by condition (b), all flow out of 
P must go to z. The flow equality still holds if there is flow in an edge 
(a, z). 


We let |g| denote the value of the a—z flow ¢. This value equals the sum 
of the flow out of a, or equivalently by Theorem 1, the flow into z. Let us 
consider the question of how large |g| can be. One obvious upper bound is the 
sum of the capacities of the edges leaving a, since 


lo|= > ges > Ke) 
e€Out(a) e€Out(a) 

Similarly, the sum of the capacities of the edges entering z is an upper 
bound for |g]. Intuitively, |g| is bounded by the sum of the capacities of any 
set of edges that cut all flow from a to z. We define the capacity k(P, P) of 
cut (P, P) to be 


kK(P,P)= >) kee) 


e& (PP) 


The capacity of the a—z cut (P, P), where P = {a, b, c}, in Figure 4.2 is 13. 
This tells us that no a—z flow in the network in Figure 4.2 can have a value 
greater than 13. 


Theorem 2 
For any a—z flow g and any a—z cut (P, P) in a network N, lo| < KP, P). 


Proof 


Informally, |g| should equal the total flow from P to P [which is bounded by 
k(P, P)], but we cannot prove an equality of this type using condition (c’) 
without first modifying N. We cannot use (c’) in N because condition (c’) 
requires that P not contain the source a. 
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Expand the network N by adding a new vertex a’ with an edge e’ = (a’} a) 
of immense capacity. Note that a’ will be part of P. Assign a flow value of |g] 
to e’, yielding a valid a’—z flow in the expanded network (see Figure 4.3). In 
effect, the old source a now gets its flow from the ‘‘super source’’ a’. In the 
new network, we can apply condition (c’) to P. It says that the flow into P, 
which is at least |g] (other flow could come into P along edges from P), equals 
the flow out of P. Thus 


M= > @©)"| d gos Dd Mey =KP,P)o (*) 


e€ (P,P) e&(P,P) e€ (P,P) 


Corollary 2a 
For any a—z flow g and any a—z cut (P, P) in a network N, lol = KP, P) if 
and only if: 


(i) For each edge e € (P, P), ge) = 0. 
(ii) For each edge e & (P, P), g(e) = ke). 


Further, when |g| = k(P, P), v is a maximal flow and (P, P) is an a—z cut of 
minimal capacity. 


Proof 


Consider the two inequalities in (*) of the preceding proof. The first inequality 
is an equality—the flow from P into P (in the expanded network) equals |g|— 
if condition (i) holds; otherwise the flow into P is greater than |g|. The second 
inequality is an equality—the flow out of P equals k(P, P )—if condition (ii) 
holds; otherwise it is less. Thus equality holds in (*) if and only if conditions 
(i) and (ii) are both true. The last sentence in the corollary follows directly 
from Theorem 2. 


Now that we have developed all the concepts needed to present our flow 
maximizing algorithm, we next discuss an intuitive but faulty technique that 
can sometimes be used as a shortcut in place of the correct algorithm. After 
the fault in the shortcut is exposed, the algorithm can be more easily understood 
and appreciated. 

All normal flows can be decomposed into a sum of unit-flow paths from 
a to z, for short, a—z unit flows (abnormal flows that cannot be so decomposed 
are discussed in Exercise 23). For example, in a telephone network, the flow 
from New York to Los Angeles can be decomposed into individual telephone 


Figure 4.3 
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calls. Similarly, flow of oil in a pipeline network can be decomposed into the 
paths of each individual petroleum molecule. Formally, an a—z unit flow ¢, 
along a—z path L is defined as ¢,(e) = 1 if e is in L and = 0 if e is not in L. 

If 2, GQ, ..., Gy are a—z unit flows, then g = g + @ +--+ + Gm Will 
satisfy conditions (b) and (c) in the definition of a flow (given at the beginning 
of this section) since $), g, ..., Qn Satisfy these conditions. If in addition, g 
satisfies condition (a)—¢(e) = k(e), for all e—then ¢ is a valid flow. 

This suggests a way to build a maximal flow. We build up the flow as 
much as possible by successively adding a—z unit flows together, always being 
sure not to exceed any edge’s capacity. An additional unit flow can use only 
unsaturated edges, edges where the present flow does not equal the capacity. 
So we must build paths consisting of unsaturated edges. We define the slack 
S(e) of edge e in flow ¢ by 


s(e) = ke) — ge) 


If s is the minimum slack among edges in the a—z unit flow g, then we can 
use an additional flow along L of sg, = gy + o& +o + +++ + ©, (s times). 


Example 1 


Let us use the method just outlined to build a maximal a—z flow for the network 
in Figure 4.4a. Note, as an upper bound, that the value of a flow cannot exceed 
10, the capacity of edges going out of a. 

We start with no flow, that is, p(e) = 0, for all e. Now we find some path 
from a to z, for example, the a—z path L, = a—b—d-z. The minimum slack on 
L, is 3 (at the start, the slack of each edge is just its capacity). So to our initial 
zero flow, we add the flow 3¢,. All edges except (b; d) are still unsaturated. 

Suppose that we next find the a—z path L, = a—c—e-z, also with minimum 
slack 3. Our current flow is 3g, + 3¢,,, aS shown in Figure 4.4b. The path 
L; = a—b—e—z with minimum slack 2 can be used to get the augmenting flow 
2¢1,. Figure 4.4c shows the remaining unsaturated edges with their slacks. 

The only a—z path in Figure 4.4c is L, = a—c—d—z with minimum slack 
1. After adding the flow ¢,, (see Figure 4.4d), we can get no further than from 
a to c. Observe that we have saturated the edges in the a—z cut (P, P), where 
P = {a, c}. The value of the final flow, 9, equals the capacity k(P, P) of this 
cut, and so by Corollary 2a the flow must be maximal. = 


Example 2 


Suppose the network in Figure 4.4a is redrawn as in Figure 4.4e, with the 
positions of d and e switched. 

Let us again choose augmenting a~z flow paths across the top and bottom 
of the network, now having sizes 5 and 1, respectively. Since edges (a; b) and 
(c; d) are saturated by these flow paths, the only possible a—z path along 
unsaturated edges is L; = a—c—e-z with minimum slack 1 [the minimum 
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Slack in unsaturated edges for 3¢,, + 31, + 2,, 3g, + 3@1,, + 241, + G1, 


¢ and saturated cut (P,, P,) with P, = {a,c,e} | Corrected maximal flow 


Figure 4.4 


occurring in edge (e, z)]. After adding the a—z unit flow g,,, we get the flow 
¢% Shown in Figure 4.4e. The cut (Pp, Po), where Py = {a, c, e}, is saturated, 
and so no more augmenting a—z unit flows exist. Yet |g| = 7 and k(Pp, Po) = 
12. Remember that a flow of size 9 was obtained for this same network in 
Example 1! What has happened? 

We now see that an arbitrary sequence of successive augmenting a—z unit 
flows need not inevitably yield a maximal flow. We are also faced with a flow 
g and a saturated a—z cut (Po, Po) such that |go| < k(Po, Po). Corollary 2a 
implies that there must be some flow in an edge e € (Po, P,). Looking at Figure 
4.4e, we see that the flow path L’ = a—b—e—z crosses the a—z cut (Po, Po) 
backwards (from P to Py) on the edge (b; e); or equivalently, L' crosses the 
cut forward twice, on edge (a; ) and again on (e; z). Thus the 5 units of flow 
along L’ use up 10 units of capacity in the cut, whence k(Po, Po) is 5 units 
greater than |g. 
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The reason that the sequence of augmenting flow paths in this example 
did not lead to a maximal flow can be explained intuitively as follows. By 
sending 5 units of flow along L’ (see Figure 4.4e), we have routed all the flow 
passing through b on to e and none of it to d. Then only 1 unit of flow passing 
through c can be routed on to e and then along edge (e; z). But much of the 
flow through c must go to e, since the capacity of (c; d) is only 1. In sum, the 
initial 5-unit flow along a—b—e—z was a ‘‘mistake’’ because some of the capacity 
in edge (e; z) should have been ‘‘reserved’’ for flow from c. m= 


How can we avoid or correct such mistakes? If we understood where the 
mistakes were made, we could change some of the flow paths and try a new 
sequence of augmenting flow paths. However, we are likely to make other 
mistakes in subsequent constructions. Indeed, there may be certain networks 
in which it is impossible not to make such a mistake, no matter what sequence 
of flow paths is used. In terms of cuts, we may always end with a saturated 
a-z cut that one of our flow paths crosses twice. 

Fortunately, there is a procedure to correct ‘‘mistakes’’ and thereby further 
increase the flow. The method will not look for edges that must be ‘‘reserved’’ 
for later flow paths, as suggested above (that is too hard a problem). Rather, 
it looks for flow that is going the wrong way (backwards) across an a—z cut. 
Then it finds a way to decrease the backward flow without changing the forward 
flow across the cut, the result being more total flow across the cut (and through 
the network). The following example presents the idea behind this procedure. 


Example 2 (continued) 


The edge (b> e) contains 5 units of flow that is going backward across the 
saturated cut (Py, Po), where Py = {a, c, e}. By how much can the flow in 
(b; e) be reduced? 

Condition (c) of a flow—flow in equals flow out—requires that a reduction 
of the flow into e from b must be compensated by an increase to e from 
elsewhere in Py (if the compensating flow comes from Py) we would have a 
new backward flow). Such an increase must in the end come from a. Thus, we 
need an a—e flow path in Py. The only such path is K, = a—c—e. Similarly, a 
reduction of the flow out of b to e must be compensated by an increase in flow 
out of b to somewhere else in Py. So we need a b—z flow path in P,. The only 
such path is K, = b—d-e. 

The minimum slack along K, is 2 and the minimum slack along K, is 3. 
Then we can decrease the flow in (bj e) by 2 while increasing the flow in K, 
and K, by 2. Figure 4.4f shows the resulting maximal flow. Note that this new 
maximal flow is different from the maximal flow in Figure 4.4d, although both 
saturate the a—z cut ({a, c}, {b, d, e, z}). = 


A chain in a directed graph is a sequence of edges that forms a path when 
the direction of the edges is ignored. A unit-flow chain from a to z along the 
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a-z chain K is a ‘‘flow’’ g x with a value of 1 in each edge of K forwardly 
directed, a value of —1 in each edge of K backwardly directed, and a value of 
0 elsewhere. Note that g is not really a flow because it can assume a negative 
value on some edges. However, if g is a flow that already has positive values 
in each backwardly directed edge of K and has slack in each forwardly directed 
edge of K, then o + gx is a flow. 

The flow correction made in the continuation of Example 2 consisted of 
a (2-unit) a—z flow chain along chain K = a—c—e—b—d—z. When no backwardly 
directed edges occur in a flow chain, then it is simply a flow path. We shall 
see that any sequence of augmenting a—z flow chains leads to a maximal flow. 
Flow chains are the appropriate generalization of flow paths that both build 
additional flow and simultaneously correct possible ‘‘mistakes.”’ 

We now present a flow chain algorithm to increase the value of a flow. 
The algorithm is designed so that if it fails to obtain an augmenting a—z flow 
chain, it will produce a saturated a—z cut whose capacity equals the value |¢| 
of the current flow ¢. Thus by Corollary 2a, g¢ would be maximal. 

The algorithm recursively tries to build augmenting flow chains from a to 
all vertices in a manner reminiscent of the shortest path algorithm in Section 
4.1. The algorithm assigns two labels to a vertex g: (p~, A(q)), where p is the 
previous vertex on a flow chain from a to q, the superscript of p is + if the 
last edge of the chain is (p; q) and is — if the last edge is (g; p) and A(q) is 
the amount of additional flow that can be sent from a to g. That is, A(q) is the 
minimum slack among the edges of the chain from a to g. On a backwardly 
directed edge e, the slack is the amount of flow that can be removed, namely 
g(e). A + superscript on g means flow is being added to edge (p; q); a 
— superscript means flow is being subtracted from edge (q; p). 


Augmenting Flow Algorithm 


1. Give vertex a the labels (-, ©). 
2. Call the vertex being scanned p with second label A(p). Initially, p = a. 


(a) Check each incoming edge e = (q; p). If g(e) > 0 and gq is unlabeled, 
then label g with [p~, A(q)], where A(q) = min[A(p), ¢(e)]. 

(b) Check each outgoing edge e = (p; q). If s(e) = ke) — ge) > 0 
and qg is unlabeled, then label g with [p*, A(qg)], where A(q) = 
min[A(p), s(e)]. 

3. If zhas been labeled, go to Step 4. Otherwise choose another labeled vertex 
to be scanned (which was not previously scanned) and go to Step 2. If 
there are no more labeled vertices to scan, let P be the set of labeled 
vertices, and now (P, P) is a saturated a—z cut. Moreover, |g| = k(P, P), 
and thus ¢ is maximal. 

4. Find an a-—z chain K of slack edges by backtracking from z as in the shortest 
path algorithm. Then an a—z flow chain gx along K of A(z) units is the 
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desired augmenting flow. Increase the flow in the edges of K by A(z) units 
(decrease flow if edge is backward directed in K). 


Like the shortest path algorithm, the flow algorithm extends partial-flow 
chains from currently labeled vertices to adjacent unlabeled vertices. Before 
we prove that repeated application of our algorithm always leads to a maximal 
flow, let us give some examples. 


Example 2 (continued) 


Let us apply our augmenting flow algorithm to the flow in Figure 4.4e. 

Vertex a is labeled (—, ©) by Step 1 of the algorithm. Next we apply Step 
2b at a (Step 2a does not apply at a since the definition of a flow does not 
allow flow into the source a). The edge (a; b) is saturated, but (a; c) has slack 
5 — 2 = 3. So we label c (a*, 3). At c, Step 2a finds no incoming flow from 
an unlabeled vertex, but Step 2b finds slack in edge (c; e) going to unlabeled 
vertex e. We label e (c*, 2) [2 is the minimum of A(c), the extra flow we can 
get to c, and the slack in edge (c} e)]. At e, Step 2a finds a positive flow 
entering on edge (b; e) from unlabeled vertex b. We label b (e~, 2). From b, 
we label d (b*, 2), and from d we label z (d*, 2). Sink z is now labeled and 
so Step 4 tells us we can get A(z) = 2 more units of flow from a to z. Backtracking 
with the labels, the flow chain K (in backwards order) is z—d—b—e—c—a. The 
new flow g + 2¢, is shown in Figure 4.4f. = 


Example 3 


Consider the network shown in Figure 4.5a. We seek a maximal flow from a 
to z. 

If a maximal flow were being found by a computer, it would have to start 
with a zero flow. When solving a flow problem by hand, we can speed the 
process by starting with a (nonzero) flow obtained by inspection (in small 
networks we can often obtain a maximal flow by inspection). Let the initial 
flow be ¢ = 4x, + Agx, + 5¢x,, where K, = a—b—e—z, K, = a—c—d-f-z, and 
K, = a—d-f-z. See Figure 4.5a. The reader may see that we sent too much 
flow from c to d. Some of it should have gone directly from c to f, 

We now apply the labeling algorithm to the flow ¢. Label vertex a (-, ~). 
Scanning edges at a, there cannot be incoming edges to the source with flow 
[by part (b) of the definition of a flow], but there are three outgoing edges: 
edge (a, b) has slack s(a; b) = 2 > 0 and b is unlabeled, so we label b (a’, 
2), where 2 is the minimum of A(a) (= ©) and s(a- b); edge (a; c) has no 
Slack; edge (a; d) has slack s(a; d) = 2 and d is unlabeled, so we label it (a*, 
2). 


Next we scan b (we will scan vertices in the order that they were labeled). 
There are two incoming edges at b; edge (a; b) has g(a? b) = 4 > 0 but a is 
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already labeled; edge (c; b) has no flow. There is one outgoing edge at b: edge 
(b> @) has slack s(b; e) = 3 and e¢ is unlabeled, so we label e (b*, 2), where 2 
is min [A(bd), s(bj e))]. 

Next we scan at d. There are two incoming edges at d: edge (a, d) comes 
from a labeled vertex; edge (c; d) has g(c; d) = 4 and c is unlabeled, so using 
Step 2a we label c (d~, 2), where 2 = min[A(d), g(c; d)]. There is one outgoing 
edge at d: edge (df) is saturated. 

No labeling can be done from e. At c we label f with (c*, 2). At f we label 
z with (f*, 2). 

Since z is now labeled, the labeling procedure terminates. We can send 2 
[= A(z)] units in the augmenting a—z flow chain ¢x,, where K, = a—d-c-f—z, 
found by the backtracking procedure. Recall that since edge (c;, d) is backwardly 
directed in K,, the flow chain ¢ , subtracts 2 units from the current flow in 
edge (c; d). The new flow gy’ = g + 29x, is shown in Figure 4.5. 

We eliminate all the current labels and restart the labeling algorithm from 
scratch. Label vertex a (-, ©). From a we label b with (a*, 2). At b we label 
e with (b*, 2). At e we label c with (e*, 2). At c we label d with (c*, 2) and f 
with (c*, 1). At d we can make no new labels. At f we label z with (f*, 1). 
The augmenting a—z flow chain is ¢«,, with Ks = a-b—e—c—f-z, using forward 
edge (e; c). Our new flow is ge" = pt 2gx, + Pky shown in Figure 4.5c. 

The reader may have observed that the flow ¢” is maximal. The incoming 
edges at z are now saturated. Let us again apply the augmenting flow algorithm 
to flow g”. At a we label b; at b we label e; at e we label c; and at c we label 
d. No more vertices can be labeled. Let P be the set of labeled vertices. Then 
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(P, P) is the saturated a—z cut specified by the algorithm with |g’| = 16 = 
k(P, P). = 


In applying our algorithm, we must always check incoming edges in Step 
2a for the possibility of minus labeling of vertices, even though this labeling 
is very infrequent. Recall that the minus labeling corresponds to correcting a 
mistaken flow assignment. A permissible shortcut would be to use only positive 
labeling (as in the faulty procedure discussed earlier) until no new flow paths 
can be found, and then apply the full algorithm to hunt for ‘‘mistakes.’’ The 
use of this shortcut serves to increase the importance of having a rigorous proof 
that, when repeatedly applied to any given flow, our algorithm will yield a 
maximal flow. 


Theorem 3 


For any given a—z flow g, a finite number of applications of the augmenting 
flow algorithm yields a maximal flow. Moreover, if P is the set of vertices 
labeled during the final (unsuccessful) application of the algorithm, then 
(P, P) is a minimal a—z cut set. 


Proof 


There are two main parts to the proof. First, if ~ is the current flow and ¢, is 
the augmenting a—z unit flow chain (along chain K) found by the algorithm 
with m = A(z), then we must show that the new flow o + mg, is indeed a 
legal flow in the network. Both and g, satisfy flow conditions (b) and (c) 
and are integer-valued. Hence ~ + mg, also satisfies (b) and (c) and is integer- 
valued. The labeling algorithm is designed so that m = A(z) is the minimum 
slack (of the appropriate kind) along chain K and hence ~ + mg, satisfies flow 
condition (a): 0 = g(e) + me;(e) = ke). 

Since m is a positive integer, each new flow is larger by an integral amount. 
The capacities and the number of edges are finite, and so the algorithm must 
eventually halt—fail to label z. Let P be the set of labeled vertices when the 
algorithm halts. Clearly (P, P) is an a—z cut, since a is labeled and z is not. 
Observe that there cannot be an unsaturated edge from a labeled vertex p to 
an unlabeled vertex q, or else at p we could have labeled q in Step 2b. Similarly, 
there cannot be a flow in an edge from an unlabeled vertex g to a labeled vertex 
Pp, or else again at p we could have labeled q in Step 2a. Thus, both conditions 
of Corollary 2a are satisfied. Hence the value of the final flow equals k(P, P) 
and is maximal. Also (P, P) is a minimal a—z cut. # 


Corollary 3a Max Flow-—Min Cut Theorem 


In any directed flow network, the value of a maximal a—z flow is equal to the 
capacity of a minimal a—z cut. 


Let us now indicate how flows in a directed network can be used to model 
a large variety of extensions in directed and undirected networks. 
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Example 4 Undirected Networks 


Suppose the undirected network in Figure 4.6 represents a network of telephone 
lines (the capacity of an edge is the number of calls the line can handle). We 
wish to know the maximal number of calls that the network can simultaneously 
carry between locations a and z. That is, we seek the value of a maximal flow 
in this network. 

To make the network directed, we can replace each undirected edge (x, y) 
by the two edges (x; y) and (y; x), each with the same capacity as (x, y). An 
equivalent approach is to allow a directed flow in undirected edges. If e = 
(x, y), p(e) would be a number with an ‘‘arrow’’ indicating whether the flow 
goes from x to y or from y to x. Step 2 of the flow algorithm is modified as 
follows: When checking edges at a labeled vertex p, edges with a flow directed 
inward are treated like incoming edges and edges with no flow or flow away 
from p are treated like outgoing edges. m= 


Example 5 Flow Networks with Supplies and Demands 


Consider the network of solid edges in Figure 4.7 with supplies (capacitated 
sources) and demands (capacitated sinks). Vertex b can supply up to 60 units 
of flow, and vertices c and d can each supply 40 units. Vertices h, i, and j have 
flow demands of 50, 40, and 40 units, respectively. Can we meet all the 
demands? The sources could be oil refineries and the sinks oil-truck distribution 
centers, or the sources factories and the sinks warehouses. 

We model this network problem with a standard one-source, one-sink 
network as follows. Make b, c, and d regular nonsource vertices. Add a new 
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source vertex a and edges (a; b), (a; c), and (a; d) having capacities 60, 40, 
and 40, respectively. This construction simulates the role of the capacitated 
sources b, c, d. Next make h, i, and j regular nonsink vertices and add a new 
sink vertex z and edges (h; z), (i; z), and (j; z), having capacities 50, 40, and 
40, respectively. 

A flow satisfying the original sink demands is equivalent to a flow in the 
new network that saturates the edges coming into z, that is, a flow of value 
130. Such a flow exists if no a—z cut has capacity less than 130. = 


Example 6 Edge-Disjoint Paths in a Graph 


We are going to send messengers from a to z in the graph shown in Figure 
4.8. Because certain edges (roads) may be blocked, we require each messenger 
to use different edges. How many messengers can be sent? That is, we want 
to know the number of edge-disjoint paths. 

We convert this path problem into a network flow problem by assigning 
unit capacities to each edge. One could think of the flow as ‘‘flow messengers,”’ 
and the unit capacities mean that at most one messenger can use any edge. The 
number of edge-disjoint paths (number of messengers) is thus equal to the 
value of a maximal flow in this undirected network. (See Example 4 for flows 
in undirected networks.) 

Observe that we have implicitly shown that a maximal a—z flow problem 
for a unit-capacity network is equivalent to finding the maximal number of 
edge-disjoint a—z paths in the associated graph (where edge capacities are 
ignored). This equivalence can be extended using multigraphs to all networks 
by replacing each k-capacity edge (x, y) with k multiple unit-capacity edges 
and now proceeding with the above conversion. = 


Example 7 Dynamic Network Flows 


We want to know how many autos can be shipped from location a to location 
z in four days through the network in Figure 4.9a. We assume that each edge 
(x; y) is the route of a train that leaves location x daily for a nonstop run to 
location y. The first number associated with an edge is the capacity of the trains 
(number of autos). The second number is the number of days the trip takes. 
Autos may be left temporarily at any location in the network. 

We turn this dynamic problem into a static a—z flow problem by adding 
the dimension of time to our network: Each vertex x is replaced by five vertices 


Figure 4.8 
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Xoy X1, X29 X35 X4, Where the subscript refers to the ith day in the four-day shipping 
time (the starting day is day 0). For each original edge (x, y), which takes k 
days to traverse, we make edges (Xo, Yu) (1s Yast)» «++» Gs-e> Ys), each with 
the same capacity as (x; y). For each vertex x, we make four edges of the form 
(x; xi41) with very large (in effect, infinite) capacity; these edges correspond 
to the provision that permits autos to be left temporarily at any vertex from 
one day to the next. See Figure 4.9b. 

A maximal flow in the new network gives the maximal dynamic flow in 
the original network. Note that for vertices other than a and z, the range of the 
subscripts of usable vertices will be at most 1 to 3. Nondaily trains could easily 
be incorporated into the model. = 


It is hoped that the preceding four examples have impressed the reader 
with the versatility of our basic static a—z flow model. More examples are to 
be found in the exercises. 


4.2 EXERCISES 


Mo Summary of Exercises Exercises 1-17 mimic or extend the 


flow computations and models in Examples 3-7. Exercises 18-38 develop the 
theory of network flows (later exercises are very challenging). Exercises 39-42 
present programming projects. 
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In the following problems, unless directed otherwise, the reader should 


route most of the flow by inspection and then, when a near-optimal flow is 
obtained, use the augmenting flow algorithm to get further flow and afterwards 
a minimal cut. 


1. Apply the augmenting flow algorithm to the flow in Figure 4.4. 
2. Find a maximal a—z flow and minimal capacity a—z cut in the following 


3. 


4. 


networks: 
(a) 
12 10 
a Zz 
10 15 


Find a maximal flow from a to z in the network of Figure 4.6 (using the 
associated directed network). Also give a minimal capacity a—z cut. 


Treat the first number assigned to each edge in Exercise 3 of Section 4.1 
as a capacity and let the edges be directed by the alphabetical order of 
their endpoints with L preceding a and W following n [e.g., edge (f, /) 
goes from f to i]. 


(a) Find a maximal L—W flow and a minimal L—W cut in this network. 
(b) By inspection, find a different maximal L—W flow. 


(c) Make a misrouted flow that yields a saturated L—W cut whose capacity 
is greater than the flow (similar to the situation in Figure 4.4e). Now 
apply the algorithm. 


(d) In addition, let the second number of each edge be a lower bound 
on the flow in that edge. Try to find an L—W flow satisfying both 
constraints. 


. Delete vertices p through y in Figure 3.12 in Section 3.3 and treat the 


remaining edge numbers as capacities (edges are still undirected). Use the 
undirected version of the flow algorithm suggested in Example 4. 
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11. 


12. 


13. 


14. 


(a) Find a maximal flow from b to j and a minimal b—j cut. 

(b) Build a b-/ flow that saturates edge (&, /) (in either direction) and then 
apply the algorithm. 

(c) Treat the edge numbers as lower bounds and modify the algorithm to 
find a minimal b—j flow, using the whole network (a through y). 
Remember that there is no flow into b or out of j. 


. Is there a flow meeting the demands in Figure 4.7? 
. Suppose vertices b, c, d in Figure 4.7 have unlimited supplies. How much 


flow can be sent to the set {A, i, j}? Explain your model. 


. Vertices b, c, d have supplies 30, 20, 10, respectively, and vertices j, k 


have demands of 30 and 25 in this network. 


(a) Find a flow satisfying the demands, if possible. 
(b) Reverse the direction of edge (h, g) and repeat part (a). 


. Solve the messenger problem in Example 6. 
10. 


Suppose that up to three messengers can use each edge in Example 6. Now 

how many messengers can be sent? Is the answer with such a modification 

always just three times the answer to the original problem? 

(a) Ignoring the numbers of the edges in Exercise 3 of Section 4.1, what 
is the size of the largest set of edge-disjoint paths from L to W? 

(b) What is the size of the largest set of paths from L to W such that no 
edge is used by more than five paths? 


Suppose that no more than five units of flow can go through each intermedi- 
ate vertex b, c, d, in Figure 4.5a. Now find a maximal flow in this revised 
network and associated minimal a—z cut. 

Suppose that no more than 20 units of flow can go through each intermediate 
vertex b, c, d, e, f, g in Figure 4.6. Now find the maximal flow in this 
revised network. 

What is the size of a largest set of vertex-disjoint paths from a to z in 
Figure 4.8? 
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. Solve the dynamic flow problem in Example 7. 
. In Example 7, suppose that the trains do not run every day. Let trains 


depart from a and c on Monday, Wednesday, and Friday, and from b and 
d on Tuesday, Thursday, and Saturday. In one week, Monday through 
Sunday, how many cars can be sent from a to z in that network? 


. Suppose that it takes one day to traverse each edge in Figure 4.2. How 


many units can be moved from a to z in five days in this network? 


. In the proof of Theorem 3, show that ¢° = m@x satisfies the second part 


of condition (b). 


. (a) Prove that if a directed network contains edges (x; y) and (y; x) for 


some x, y, then the augmenting flow algorithm would never make 
assignments that would have flow occurring simultaneously in both 
edges. 


(b) Could edges (x; y) and (y; x) both get flow if the augmenting flow 
algorithm in Step 2 checked outgoing edges before incoming edges? 

(a) Restate the augmenting flow algorithm so that the labeling starts at z 
and ‘‘works back’’ to a. 


(b) Restate the augmenting flow algorithm for undirected networks (see 
Example 4). Sketch a proof of this algorithm. 


. Show that the set of edges used to label vertices in Steps 2a and 2b of the 


augmenting flow algorithm form a tree rooted at a. 


. (a) Give a weakened replacement for condition (b) in the definition of an 


a-z flow. The new condition should still ensure that the net flow is 
from a to z. 

(b) Suppose condition (b) is eliminated. Can there exist maximal flows 
that violate condition (b)? Prove or give a counterexample. 


. Build a flow in the network in Figure 4.5a with the prescribed properties: 


(a) Its value |¢g| is 0 but not all edges have 0 flow. 


(b) Its value is 2 but it cannot be decomposed into a sum of a~z flow 
paths. 


. (a) Show that for a flow in a (directed or undirected) network, if ¢ is 


circuit-free, that is, there is no set of edges with flow that form a 
(directed) circuit, then ¢@ can be decomposed into a sum of a—z flow 
paths. (Hint: Prove by induction on the value of ¢.) 

(b) Use the proof to get a decomposition algorithm for any such ¢. 


(c) Conclude that any flow ¢ can be decomposed into |g| a—z flow paths 
plus a set of circuits. 


. (a) Prove that starting from a circuit-free flow (see Exercise 24), perhaps 


a zero flow, the maximal flow generated by the augmenting flow 
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26. 


27. 


algorithm is circuit-free and hence [by Exercise 24(b)] can be decom- 
posed into a sum of a—z flow paths. 

(b) Use part (a) and Exercise 24(b) to find the routings of a maximal set 
of phone calls in Example 4. 


How many times is an edge checked in Step 2 to perform one complete 
iteration of the augmenting flow algorithm (resulting in increased flow or 
a min-cut)? 

Show that if a flow is decomposed into unit-flow paths and if each unit- 
flow path crosses a given saturated cut once, then the flow is maximal. 


28. A cut-set in an undirected graph G is a set S of edges whose removal 


29. 


30 


31. 


32. 


33. 


disconnects G but no proper subset of S disconnects G. Prove that in an 
undirected flow network, every cut-set that separates a and z is an a—z cut 
and every minimal a—z cut is a cut-set. 


Let G be a connected, undirected graph and a, b be any two vertices in G. 


(a) Show that there are k edge-disjoint paths between a and 5 if and only 
if every a—b cut has at least k edges. 


(b) Show that there are k vertex-disjoint paths between a and b if and only 
if every set of vertices disconnecting a from b has at least k vertices. 


As mentioned in Example 6, we can model a flow network N (directed or 
undirected) by another multigraph flow network N’ in which each edge 
has unit capacity; N’ has the same vertices as N and for each edge e in N 
there are k(e) edges in N’ paralleling e. Since a flow in N’ takes 0 or 1 
values on the edges, we can drop the capacities in N’ to get a multigraph 
G'. A flow in N’ is just a subset of edges (with flow) in G’. 


(a) Characterize the subsets of edges in G’ that correspond to an a—z flow 
in N’. 

(b) Restate the augmenting flow algorithm in terms of G’. 

(c) Using the multigraph model, prove that any a—z flow ¢ in N’ contains 
|~| a—z flow paths. 


(d) Using the multigraph model and assuming the result in Exercise 29(a), 
prove Corollary 3a (max flow—min cut theorem). 


Suppose the numbers on the edges in a directed network represent lower 
bounds for the flow. State a decreasing flow algorithm. Sketch a proof of 
this algorithm and deduce the counterpart of Corollary 3a. 

(a) Explain how the algorithm in Exercise 31 can be applied to a bipartite 
graph to find a minimal set of edges incident with every vertex in the 
graph. 

(b) Find such an edge set for the bipartite graph in Figure 4.10 of Section 
4.3. 


Suppose we have upper and lower bounds k,(e) and k,(e), respectively, on 


34. 


35. 


36. 


37. 
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the flow in each edge e in a directed network and that we are given a 
feasible flow (satisfying these constraints). 


(a) Modify the augmenting flow algorithm so that it can be used to construct 
a maximal flow from a given feasible flow. 


(b) Prove that the maximal flow has value equal to the minimum of 
k, (P, P)_ — k,P, P), among all a—-z cuts (P, P), where k,(S, S) 
and k,(S, Ss ) are the sums of the upper and lower bounds of edges in 
a cut (S, S). 


Consider a directed network with supplies and demands as in Example 5. 
Let z(P) be the total demand of vertices in set P and a(P) be the total 
supply of vertices in P. 


(a) Prove that the demands can be met if and only if for all sets P, 
2(P) — a(P) = K(P, P). (Hint: Generalize the reasoning in Example 5.) 

(b) Prove that the supplies can all be used if and only if for all P, 
a(P) — 2(P) = K(P, P). 


Suppose that the edges in Figure 4.9a in Example 7 were undirected. How 
would we construct a static flow model to simulate this dynamic flow 
problem so that a maximal static flow in the new network would correspond 
to a maximal dynamic flow? Are there any difficulties? 


Suppose an undirected flow network is a planar graph and a (at the left 
side) and z (at the right side) are both on the unbounded region surrounding 
the network. Draw edges extending infinitely to the left from a and to the 
right from z; give them infinite capacity. This divides the unbounded region 
into two unbounded regions, an upper and a lower unbounded region. Now 
form the dual network (see Section 1.4) of this planar network with each 
dual edge assigned the capacity of the original edge it crosses. 


(a) Show that a path from the upper unbounded region’s vertex to the 
lower unbounded region’s vertex in the dual network corresponds to 
an a—z cut in the original network. Thus, a shortest such path in the 
dual network is a minimal a-z cut in the original network. 


(b) Draw the dual network for the network in Figure 4.6 and find a shortest 
path (using the algorithm in Section 4.1) corresponding to a minimal 
cut in the original network. 


Suppose an undirected flow network N is a planar graph and a (at the left 

side) and z (at the right side) are both on the unbounded region surrounding 

the network. Consider the following flow-path building heuristic. Starting 

from a, build an a—z flow path by choosing at each vertex x the first unsatu- 

rated edge in clockwise order starting from the edge used to enter x. 

(a) Apply this heuristic to the network in Figure 4.8 to find a maximal 
flow. 


(b) Show that repeated use of this heuristic yields a maximal flow in N. 
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38. Prove that repeated use of the augmenting flow algorithm yields a maximal 
flow in a finite number of applications for networks with irrational capacities 
provided that vertices are ordered (indexed) and that the next vertex scanned 
in Step 2 of the algorithm is the labeled vertex with lowest index. 


39. Write a program to find maximal flows in directed networks (the networks 
are input data). 


40. Write a program to find the maximal number of paths in an undirected 
graph between two given vertices such that: 


(a) The paths are edge disjoint. 
(b) The paths are vertex disjoint. 


41. Write a program that, when given a network with an a—z flow of value k, 
will extract from the flow k unit-flow paths from a to z. 


42. Write a program to find, for a given pair of vertices a and b in a given 
connected graph, a minimal set of vertices whose removal disconnects a 
from b (see Exercise 29). 


4.33 ALGORITHMIC MATCHING 


In this section, we apply network flows to the theory of matchings. Recall that 
a bipartite graph G = (X, Y, E) is an undirected graph with two specified 
vertex sets X and Y and with all edges of the form (x, y), x © X, y € Y. See 
Figure 4.10. Bipartite graphs are a natural model for matching problems. We 
let X and Y be the two sets to be matched and edges (x, y) represent pairs of 
elements that may be matched together. 

A matching in a bipartite graph is a set of independent edges (with no 
common endpoints). The thicker edges in Figure 4.10 constitute a matching. 
An X-matching is a matching involving all vertices in X. A maximal matching 
is a matching of the largest possible size. As with network flows, we cannot 
always obtain a maximal matching in a bipartite graph by simply adding more 
edges to a nonmaximal matching. The matching indicated in Figure 4.10 cannot 
be so increased, even though it is not maximal. 

A typical matching problem involves pairing off compatible boys and girls 
at a dance or the one-to-one assignment of workers to jobs for which they are 
trained. A closely related problem is to find a set of distinct representatives 
for a collection of subsets. We need to pick one element from each subset 
without using any element twice. In the bipartite graph model, we make one 
X-vertex for each subset, one Y-vertex for each element, and an edge (x, y) 
whenever element y is in subset x. Now an X-matching picks a distinct represen- 
tative element for each subset. Conversely, any matching problem can be 
modeled as a set-of-distinct-representatives problem. 
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b 8 
¢ A 
d i 
e j 
f k 
Figure 4.10 m 


We employ a modification of the trick in Example 5 of Section 4.2 to turn 
a matching problem into a network flow problem. Associate a supply of 1 at 
each X-vertex and a demand of 1 at each Y-vertex. The capacities of the edges 
from X to Y can be any large positive integers, but it is convenient to assume 
that these capacities are ©. We also assume that edges are directed from X to 
Y. Now we apply the technique in Example 5 of Section 4.2 with source a 
connected by a unit-capacity edge to each X-vertex and sink z connected by a 
unit capacity edge from each Y-vertex. 

We call such a network a matching network. See Figure 4.11. The X-Y 
edges used in an a—z flow constitute a matching. A maximal flow is a maximal 
matching. A flow saturating all edges from source a corresponds to an X- 
matching. 


Example 1 


Suppose the bipartite graph in Figure 4.10 represents possible pairings of boys 
b, c, d, e, f with girls g, h, i, j, m. A tentative matching indicated by thicker 
edges in Figure 4.10 was made. Although this matching cannot be increased, 
we still wonder whether a complete X-matching is possible. 

As has happened before in flow problems, we have made a ‘‘mistake’’ in 
this matching and now need to make some reassignments. We convert this 
matching into the corresponding flow in the associated matching network, and 
we apply the augmenting flow algorithm. See Figure 4.11. 

From a, the only outgoing edge that is not saturated is (aj c), since c is 
the only X-vertex not involved in the initial matching. We label c (a*, 1). From 
c we can label the two Y-vertices, g and i, adjacent to c with the label (c*, 1). 
At g the one outgoing edge—to z—is saturated. Then there must be an incoming 


Figure 4.11 
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edge to g with positive flow, namely (5; g), coming from the unlabeled X- 
vertex b. We label b (g~, 1). By similar reasoning, from i we label f (7, 1). 
At b and f we can label Y-vertices not currently matched to b and f. We label 
j (b*, 1) and m (f*, 1). 

Summarizing our labeling thus far, we are trying to find a match for c (the 
only X-vertex currently unmatched). From c we have possible matches with g 
and i, but to make either match, we must sever the current match of g with b 
or of i with f and rematch b or f with other Y-vertices—in this network, b can 
be rematched with j or f rematched with m. Continuing this reasoning, to 
rematch b with j requires us to sever j’s current match with e and find a new 
match for e. Or to rematch f with m requires us to sever m’s current match 
with d and find a new match for d. Now either d can be matched with h 
(currently unmatched) or e can be matched with k (also currently unmatched). 
The fact that A and k are unmatched is reflected in the fact that from either h 
or k one can label z. See the labels in Figure 4.11. 

If we label z (k*, 1), the augmenting flow chain prescribed by the algorithm 
is a~c-g—b—j—e—k—z. Our augmenting flow chain specifies that we add edge 
(c? g) to our matching, remove edge (b; g), add edge (b; j), remove edge (e; 
j), and add edge (e; k). The new flow corresponds to the matching b—j, c—g, 
d—m, e—k, f-i. Another application of the algorithm would label only a. The 
set of edges leaving a now forms a (saturated) minimal a—z cut—a sign that 
we have an X-matching. = 


Observe that the action of the augmenting flow algorithm in matching 
problems can be described as follows: Starting from an unmatched vertex x 
in X, we go on anonmatching edge (an edge not currently used in the matching) 
to a matched Y-vertex y,; then we move back from y, along a matching edge 
to a matched X-vertex x,; then we go on a nonmatching edge from x to a 
matched vertex y,; and so on until a nonmatching edge (%, y,) goes from a 
matched X-vertex x; to an unmatched Y-vertex y,. In short, starting from an 
unmatched X-vertex, we create an odd-length alternating path L of nonmatching 
and matching edges in search of an unmatched Y-vertex. Given such a path, 
we get a new, larger set of matching edges by interchanging the roles of 
matching and nonmatching edges on L. 

We have seen that in bipartite graphs, a matching is analogous to a flow 
in a network. What then is the bipartite graph counterpart to an a—z cut? 
Actually it is convenient to restrict our attention to finite-capacity a—z cuts. 
The corresponding concept is an edge cover, a set S of vertices such that every 
edge has a vertex of S as an endpoint. Recall that edge covers were encountered 
in a street surveillance problem in Section 1.1. 


Lemma 

Let G = (X, Y, E) be a bipartite graph and let N be the matching network 
associated with G. For any subsets (possibly empty) A C X and B C Y, S= 
A U B is an edge cover if and only if (P, P)) is a finite capacity a—z cut in N, 
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where P = a U (X — A) UB. Interms of P, S = (PMX) U(P MY). Further, 
|S| = k(P, P). 


Proof 


A finite capacity a—z cut cannot contain an edge between X and Y (whose 
capacity is ©). Then a possible finite-capacity cut (P, P) must consist of edges 
of the form (a; x), x © A and (y; z), y € B for some sets A C X andB CY. 
These edges block all flow (and thus are an a—z cut) if and only if any X-Y 
edge starts at some x € A or ends at some y € B, that is, if and only if § = 
A U Bis an edge cover. In terms of A and B, P = a U (X — A) UB, and in 
terms of P, A = P M X and B = Pf Y. See Figure 4.12. Also, |S| = 
K(P, P), since the edges in (P, P) all have unit capacity. 


We now prove two famous theorems about matchings in bipartite graphs. 


Theorem 1 


In a bipartite graph G = (X, Y, E), the size of a maximal matching equals the 
size of a minimal edge cover. 


Matchings correspond to flows in the associated matching network and, 
by the lemma, edge covers correspond to (finite capacity) a—z cuts. Theorem 
1 is simply a bipartite-graph restatement of the max flow—min cut theorem 
(Corollary 3a in Section 4.2). 

In the next theorem, the range R(A) of A denotes the set of vertices adjacent 
to a vertex in A. 


Theorem 2 (Hall’s Marriage Theorem) 


A bipartite graph G = (X, Y, E) has an X-matching if and only if for each 
subset A C X, |R(A)| = |A 


Proof 


The condition is necessary, for if |R(A)| < |A], a matching of the vertices in A 
is clearly impossible, and so an X-matching is impossible. 

Next we show that if |R(A)| = |A|, for all A C X, then G has a X-matching. 
By Theorem 1, an X-matching exists if and only if each edge cover S has size 


Figure 4.12 
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|S| = |x|. If S contains only X-vertices—that is, S = X—then the result is 
immediate. We must show that S cannot be made smaller by dropping some 
X-vertices and in their place using a smaller number of Y-vertices. If A is the 
set of X-vertices not in S, then SNY must contain the vertices in R(A) to cover 
the edges between A and R(A). Then we have 


|S| = |SAX| + |SAY| = |x| — [A] + ISNY| 
= |x| — |A| + |R(A)| 


Since |R(A)| = |Al, then |X| — |A| + |R(A)| = |X|, and so |S| = |X], proving 
that there is a matching of size X. 


Theorems 1 and 2 are the starting point for a large family of matching 
theorems (for example, see Exercises 14 and 19). 


Example 2 


Country A would like to spy on all meetings in its territory between diplomats 
from Country X and from Country Y. We know which pairs of diplomats are 
likely to meet. We can make a bipartite graph expressing this likely-to-meet 
relationship. Country A cannot afford to assign spies to every X diplomat or 
to every Y diplomat. Instead it wants to find a minimal set S of diplomats such 
that every possible meeting would involve a diplomat of S. 

Country A hires a graduate of this course, who immediately sees that this 
minimal spying problem is really a minimal edge cover problem in disguise. 
The problem can thus be solved by using the augmenting flow algorithm to 
find a minimal a—z cut in the associate matching network and, from the a—z 
cut, obtain a minimal edge cover using the lemma. s 


Example 3 


Suppose there are n people and n jobs, each person is qualified for k jobs, and 
for each job there are k qualified people. Is it possible to assign each person 
to a (different) job they can do? 

We model this problem using a bipartite graph G = (X, Y, E) with X- 
vertices for people and Y-vertices for jobs. Note that each vertex will have 
degree k. The question is now, Is there an X-matching? From Theorem 2, we 
can deduce a yes answer as follows: Let A be any subset of X. Since each 
vertex has degree k, there will be k|A| edges leaving A. Since at most k of these 
edges can go to any one vertex in Y, it follows that R(A) has at least |A| vertices. 
Then by Theorem 2, there is an X-matching. = 


We close this chapter with an application of matching networks to the 
sports problem of determining late in a season whether a particular team still 
has a mathematical chance of being conference champion. 
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Wins Games with with with with 
Team to date toplay Bears Lions Tigers Vampires 
Bears (t,) 
Lions (¢2) 


Tigers (t3) 
Vampires (t,) 


Figure 4.13 


Example 4 


If we know how many games each of the teams in a sports conference has 
won to date and how many games remain to be played between each pair of 
teams, then we want to know if there exists a scenario under which a specified 
team could finish the season with the most wins in the conference. 

Let us consider a concrete example with four teams, the Bears, the Lions, 
the Tigers, and the Vampires with the following records of wins and games to 
play. Suppose we wonder if it is possible for the Bears (currently with fewest 
wins) to end the season with the most wins. 

We describe how to construct a matching network model to answer the 
question, Could team ¢* end up as conference champion if t* wins all remaining 
games? Figure 4.14 gives the network for the data in Figure 4.13 when f* is 
the Bears. The X-vertices will represent the other teams besides t*. In Figure 
4.14 the X-vertices will be L (for Lions), T (for Tigers), and V (for Vampires). 
The Y-vertices will represent all different pairs (t;, t;) of other teams. In this 
case, the Y-vertices will be LT, LV, and TV. 

The size of the flow in the edge from source a to X-vertex ¢; will stand for 
the number of games won by team ¢; in the rest of the season. Let W* be the 
number of wins at the end of the season for team ¢* if it wins all its remaining 
games (for the Bears, W* is 23). The capacity of the edge (a; ¢;) will be 
W* — w;, where w; denotes the number of wins to date by ¢,. Observe that if 
team ¢, has won w; games to date, and if it wins at most W* — w; games in 
the rest of the season, then ¢; cannot win more than W* games, enabling t* to 
be the champion, or possibly co-champion (assuming ¢* wins all remaining 
games). 


Figure 4.14 (middle edges have capacity ~) 
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The flow in the X—Y edge (¢;; (¢;, t;)) will represent the number of wins of 
t; over t; in the remaining games. The capacity of all X—-Y edges is ©. The flow 
in the edge from the Y-vertex (¢;, 4) to the sink z represents the number of 
games that will be played between teams /; and ¢;. The capacity of (4, tj); 2) 
is the actual number of scheduled remaining games between ¢; and t;. Conserva- 
tion of flow at vertex ¢; requires that flow in (the number of games won in the 
rest of the season) = flow out (the number of future wins over various other 
teams). Conservation of flow at the vertex (4, t)) requires that flow in (the wins 
of t; over ¢, and of t; over t;) = flow out (the number of games these two teams 
will play). 

To have a flow that represents a possible real scenario, we require that the 
edges from Y to z all be saturated—the number of games played between 1; 
and 4; is the actual number of games scheduled between them. If a flow exists 
in this network that saturates the Y—z edges, it will specify how many wins 
each team, other than t*, could have against other teams so that no team ends 
up with more wins than ¢*. That is, it produces a scenario in which ¢* ends 
the season with the most wins. Figure 4.14 presents a flow saturating the Y—z 
edges showing how the Bears could end up with the most wins (actually tying 
in this case with the Tigers). = 


We now extend the preceding sports network model to give a necessary 
and sufficient condition for a particular team to be able to be conference 
champion. For a subset S of teams, let w(A) denote the total number of wins 
to date by all teams in S and let r(S) be the total number of games remaining 
to be played between all pairs of teams in S. Observe that for any subset S of 
teams, the total number of wins at the end of the season by teams in S will be 
at least w(S) + r(S) (because when two teams in S play each other, one of 
them must win). One constraint that guarantees that ¢* cannot finish with the 
most wins is if the average number of wins by any subset S not containing ¢* 
exceeds W*, that is, 


for any subset S not containing t*, {w(S) + r(S)}/|S| > W* (*) 


Recall that W* is the final number of wins by ¢* if ¢* wins all remaining 
games. 

We shall show that constraint (*) is sufficient as well as necessary to 
characterize when team t* is eliminated. If the min-cut is the set of Y—z edges, 
then there is a flow saturating all Y-z edges and specifying a scenario in which 
t* has the most wins (as described in the example above). Suppose there is an 
a—z cut (P, P) of smaller capacity. We now show that (*) is satisfied by some 
subset S. 

(P, P) cannot contain any of the infinite-capacity edges from X to Y, and 
so it consists solely of a—X edges and Y—z edges. Recall that edge (a, t;) has 
capacity W* — w;. Then if A = P 1X, the a—X edges in (P, P) have capacity 
|A|w* — w(A). One can show that if ¢; and 4, are in P, then (4, t;) must be in 
P (details are left as an exercise). Then the Y—z edges in (P, P) have capacity 
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Du, Ape Vis where r; denotes the capacity of edge ((#, t;); z) (= the number of 
games remaining to be played between 4 and 4) and we sum over all (4, t)) 
pairs for which at least one of ¢; or ¢; is not in A. So k(P, P) = [|A|W* — w(A)] 
+ Yuea My: 

Assuming k(P, P) is less than rj, the capacity of all the edges from Y to 
z, then 


|AIW*—w(A)+ > ry <r; 
(ft, )EA 


=> |A|W* _ w(A) eh a — > Vij => S rij — r(A) 
(ti) 


(af)CA 
=> |A|W* < w(A) + r(A) > W* < {w(A) + r(A)}/|A| 


This last inequality is (*), as claimed, with the subset being A. The teams in 
A average more wins than t*, and hence t* cannot be conference champion. 


4.3 EXERCISES 


V/, Summary of Exercises Exercises 1—9 involve matching com- 


putations. Exercises 10—20 develop theory. 


1. Bill is liked by Ann, Diana, and Lolita; Fred is liked by Bobbie, Carol, 
and Lolita; George is liked by Ann, Bobbie, and Lolita; John is liked by 
Carol and Lolita; and Larry is liked by Diana and Lolita. We want to pair 
each girl with a boy she likes. (Make girls the vertices on the left side.) 


(a) Set up the associated matching network and maximize its flow to solve 
this problem. 


(b) Make a flow corresponding to a partial pairing that has Bill with Diana 
and Fred with Carol along with two other matches (chosen by the 
reader). Now apply the flow augmenting algorithm to increase the 
matching to a complete matching. 


2. Suppose there are five committees: committee A’s members are a, c, e; 
committee B’s members are b, c; committee C’s members are a, b, d; 
committee D’s members are d, e, f; and committee E’s members are e, f. 
We wish to have each committee send a different representative to a 
convention. 


(a) Set up the associated matching network and maximize its flow to solve 
this problem. 

(b) Make a flow corresponding to the partial assignment: A sends e, B 
sends b, C sends a, and D sends f. Now apply the flow-augmenting 
algorithm to increase the matching. 


3. Let us repeat Exercise 1, but this time Bill gets a total of 5 dates, Fred 4 
dates, George 3, John 5, and Larry 3, while Ann gets 4, Bobbie 3, Diana 
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10. 


5, Carol 4, and Lolita 4. Compatible pairs may have any number of dates 
together. Model with a network to find a possible set of pairings. 


. Let us repeat Exercise 1, but this time we want to pair each boy twice 


(with two different girls) and each girl twice. Find the pairings using an 
appropriate network flow model. 


. Suppose that there are 6 universities and each will produce 5 mathematics 


Ph.D.s this year, and there are 5 colleges that will be hiring 7, 7, 6, 6,5 
math Ph.D.s, respectively. No college will hire more than one Ph.D. from 
any given university. Will all the Ph.D.s get a job? Explain. 


. In Example 4, is it possible for the Vampires to be champions (or co- 


champions) if they win all remaining games? Build the appropriate network 
model. 


. In the following table of remaining games, is it possible for the Bears to 


be champions (or co-champions) if they win all remaining games? Build 
the appropriate network model. 


Wins Games with with with with 
Team to date to play Bears Lions Tigers Vampires 
Bears 26 14 — 4 4 6 
Lions 34 5 4 — 0 1 
Tigers 32 8 4 0 — 4 
Vampires 29 11 6 1 4 — 


. In the following table of remaining games, is it possible for the Bears to 


be champions (or co-champions) if they win all remaining games? Build 
the appropriate network model. 


Wins Games with with with with 
Team to date to play Bears Lions Tigers Vampires 
Bears 20 8 — 1 2 5 
Lions 25 3 1 — 1 1 
Tigers 23 5 2 1 — 2 
Vampires 20 8 5 1 2 — 


. There are n boys and n girls in a computer dating service. The computer 


has made nm pairings so that each boy dates m different girls and each 

girl dates m different boys (m<n). 

(a) Show that it is always possible to schedule the nm dates over m nights, 
that is, the pairings may be partitioned into m sets of complete pairings. 

(b) Show that in part (a), no matter how the first k complete pairings are 
selected (0<k<m), the partition can always be completed. 


We want to construct an n X m matrix whose entries will be nonnegative 
integers such that the sum of the entries in row i is r;, and the sum of the 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
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entries in column j is c;. Clearly the sum of the r;s must equal the sum of 
the cjs. 


(a) What other constraints (if any) should be imposed on the r,s and c;s 
to assure such a matrix exists? 


(b) Construct such a 5 X 6 matrix with row sums 20, 40, 10, 13, 25 and 
column sums all equal to 18. 


Give necessary and sufficient conditions for the existence of a circuit or a 
set of vertex-disjoint circuits that pass through each vertex once in a directed 
graph G = (V, E). [Hint: Make a bipartite graph G’ = (X, Y, E) with X 
and Y copies of V, and for each edge (v,, U2) in G, G’ has an edge (x, 
yz); restate the problem. ] 


Prove that a bipartite graph G = (X, Y, EF) has a matching of size ¢ if and 
only if for all A in X, |R(A)| = |A] + ¢ — |X| = ¢ — |X — Al. (Hint: Add 
|X| — t new vertices to Y and join each new Y-vertex to each X-vertex.) 
Show that every bipartite-graph matching problem can be modeled as a 
set-of-distinct-representatives problem. 


In the analysis following Example 4, show that if ¢ and ¢, are in P, then 
vertex (f;, ;) 1s in P. 

Prove Theorem 2 using Exercise 34 in Section 4.2, which applies directly 
to the bipartite network, not the augmented a—z matching network. 


Prove Theorem 2 for complete Y-matchings (without simply interchanging 
the roles of X and Y). By symmetry, the same condition is required, but 
the set A in the reproof is chosen differently from the A in the text’s proof. 
Let 6(G) = max,cx(|A| — |R(A)|. 6(G) is called the deficiency of the bipartite 
graph G = (X, Y, E) and gives the worst violation of the condition in 
Theorem 2. Note that &(G) = 0 because A = @ is considered a subset of 
X. 


(a) Use Exercise 14 to prove that a maximal matching of G has size 
|X| — &G). 

(b) Given a maximal matching of size t = |X| — 5&(G) [assume SG) > 
0], describe how the associated minimal edge covering of Theorem 1 
can be used to find an A such that |A] — |R(A)| = &(G). 


(a) Show that the size of the largest independent set of vertices (mutually 
nonadjacent vertices) in G = (X, Y, E) is equal to lY| + &G) (see 
Exercise 17). Describe how to find such an independent set. 

(b) Use part (a) to find such an independent set in Figure 1.3. 


(Due to J. Hopcroft) Suppose each vertex of a bipartite graph G has degree 
2’, for some r. Partition the edges of G into circuits, and delete every other 
edge in each circuit. Repeat this process on the new graph (where each 
vertex now has degree 2’"'), and continue repeating until a graph is obtained 
with each vertex of degree 1. The edges in this final graph constitute a 
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matching of the vertices of G. Show that such a partition of edges into 

circuits exists in each successive graph and can be found in a number of 

steps proportional to the number of edges in the current graph. 
20. Suppose we are given a partial matching to an arbitrary graph (such a 
matching is a set of edges with no common endpoints). 

(a) Prove that a generalization of the interchange method along a path 
alternating between nonmatching and matching edges (described fol- 
lowing Example 1) can be used to increase the size of the partial 
matching until it is a maximal matching. 

(b) Randomly pick a partial matching for the graph in Figure 4.6 and use 
this method to get a maximal matching. 


21. Write a computer program to find a maximal matching and a minimal edge 
cover in a bipartite graph (the graph is input data). 


x 4.4 SUMMARY AND REFERENCES 


In this chapter we presented algorithms for two basic network optimization 
problems: shortest path and maximal flow. Principal emphasis was placed on 
a thorough discussion of maximal flows. We showed how these flows could 
be applied to a wide variety of other network problems. In Section 4.3 we used 
flow models to develop a combinatorial theory of matching. All the material 
about flows in this chapter is discussed in greater detail in the pioneering work 
Flows in Networks by Ford and Fulkerson [2] and in Network Flow Theory by 
Ahuja et al. [1]. We have omitted discussion of the speed of these algorithms. 
The interested reader is referred to Ahuja et al. [1] for efficient implementations 
of the shortest path and maximal flow algorithms. Modern maximal flow algo- 
rithms require less than O(n*) operations for an n-vertex network. 

It is often natural in network flow problems to have costs associated with 
edges so that when many possible maximal flows exist, one can ask for a least- 
cost maximal flow. Such problems are called trans-shipment and transporta- 
tion problems. Similarly, in a matching problem with many solutions (X- 
matchings), one can ask for a least-cost matching. Such a problem is called 
an assignment problem. Efficient algorithms exist for all these minimization 
problems (see [1], [2], or [3]). Furthermore, any flow optimization problem, 
with or without the above-mentioned minimization, is a problem of optimizing 
a linear function of the edge flows subject to linear equalities and inequalities, 
such as the flow constraints (a), (b), and (c) of Section 4.2. Such a constrained 
linear optimization problem is called a linear program. Linear programming 
is a principal tool of operations research, and good algorithms exist for solving 
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linear programs. However, it is much more efficient to solve network problems 
with the network-specific algorithms presented in this chapter. 


1, R. Ahuja, T. Magnanti, and J. Orlin, Network Flow Theory, Algorithms 
and Applications, Prentice-Hall, Englewood Cliffs, NJ, 1993. 


2. L. Ford and D. Fulkerson, Flows in Networks, Princeton University Press, 
Princeton, NJ, 1964. 


3. F. Hillier and G. Lieberman, Introduction to Operations Research, Holden 
Day, San Francisco, 1988. 


PART TWO 
ENUMERATION 


CHAPTER 5 


GENERAL COUNTING 
METHODS FOR 
ARRANGEMENTS 
AND SELECTIONS 


5.1 TWO BASIC COUNTING PRINCIPLES 


In this chapter we discuss counting problems for which no specific theory 
exists. We present a few basic formulas, involving permutations and combina- 
tions, most of which the reader has seen before. Then we examine a variety 
of word problems involving counting and show how they can be broken down 
into sums and products of simple numerical factors. Having read through the 
examples as passive readers, students next must assume the active role of 
devising solutions on their own to the exercises. The first exercises at the end 
of each section are similar to the examples discussed in the section. The later 
exercises, however, have little in common with the examples except that they 
require the same general types of logical reasoning, clever insights, and elemen- 
tary mathematical modeling. Facility with these three basic skills in problem 
solving, as much as an inventory of special techniques, is the key to success 
in most combinatorial applications. 

In order to motivate our problem solving, we mention applications to 
probability and statistics, computer science, operations research, and other 
disciplines. However, the details of the counting problems arising from such 
applications often appear tedious to those not actively working in the area of 
application. So instead, we will base many of the worked-out examples (and 
exercises) on recreational problems, such as poker probabilities. The solution 
of these problems requires the same mathematical skills used in more substantive 
applications. 

This section starts with two elementary but fundamental counting principles 
whose simplicity masks both their power and the ease with which they can be 
misused. 

Rigorous proofs of these two principles for an arbitrary m require mathemat- 
ical induction. Remember that the addition principle requires disjoint sets of 
objects and the multiplication principle requires that the procedure break into 
ordered stages and that the composite outcomes be distinct. 

a 
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The Addition Principle 


If there are r, different objects in the first set, r, objects in the second 
set, ..., and r, objects in the mth set, and if the different sets are 
disjoint, then the number of ways to select an object from one of the 
m sets is ry + rg +... + Pm 


The Multiplication Principle 


Suppose a procedure can be broken into m successive ( ordered) stages, 
with r; outcomes in the first stage, r, outcomes in the second stage, 
..., and r,, outcomes in the mth stage. If the number of outcomes at 
each stage is independent of the choices in previous stages and if the 
composite outcomes are all distinct, then the total procedure has r; X 
ry X +++ Xr, different composite outcomes. 


Example 1 


Professor Mindthumper has 40 students in an algebra class and 40 students in 
a geometry class. How many different students are in these two classes? 

By the addition principle, the answer is 80 students, provided that no 
students are in both classes. Suppose 10 students are in both classes. To obtain 
disjoint sets of students, we categorize students as just in algebra, just in 
geometry, and in both classes. Since 10 students are in both classes, then 
40 — 10 = 30 algebra students are just in algebra. Similarly 30 students are 
just in geometry. Now we can safely use the addition principle to sum the 
numbers of students in these three disjoint sets. The total number of students 
is 30 + 30 + 10 = 70. = 


Example 2 
Two dice are rolled, one green and one red. 


(a) How many different outcomes of this procedure are there? 


(b) What is the probability that there are different values on the two dice (no 
doubles)? 


(a) There are six outcomes of a single die. So, by the multiplication 
principle, there are 6 X 6 = 36 outcomes of the procedure. 

(b) We calculate the probability of no doubles using the probability formula 
for an event: the fraction of outcomes producing the desired event divided by 
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all possible outcomes (see Appendix A3 for details). The denominator in the 
probability fraction is 36, the total number of outcomes [determined in part 
(a)]. To get the numerator—the number of outcomes with no doubles—it is 
useful to think of one die being rolled first, say, the red die. Once the red die 
is rolled, then there are five other permissible values for the green die. So there 
are 6 X 5 = 30 outcomes with no doubles, and the probability of no doubles 
is 30/36 = 5/6. = 


Generalizing the second part of Example 2, we note that the logic of 
counting requires that ‘‘make the two outcomes different’’ really means ‘‘make 
the second outcome different from the first.’’ The number of no-doubles in 
Example 2(b) could also be answered by subtracting the 6 cases of doubles 
from all 36 outcomes, yielding 36 — 6 = 30 outcomes with no doubles. 


Example 3 


There are five different Spanish books, six different French books, and eight 
different Transylvanian books. How many ways are there to pick an (unordered) 
pair of two books not both in the same language? 

If one Spanish and one French book are chosen, the multiplication principle 
says that the selection can be done in 5 X 6 = 30 ways; if one Spanish 
and one Transylvanian book, 5 X 8 = 40 ways; and if one French and one 
Transylvanian book, 6 X 8 = 48 ways. These three types of selections are 
disjoint, and so by the addition principle there are 30 + 40 + 48 = 118 ways 
in all. m= 


The preceding example typifies a basic way of thinking in combinatorial 
problem solving: Always try first to break a problem into a moderate number 
of manageable subproblems. There may be cleverer ways, but if we can reduce 
the original problem to subproblems with which we are familiar, then we are 
less likely to make a mistake. 


Example 4 


How many ways are there to form a three-letter sequence using the letters a, 
b, c, d, e, f: (a) with repetition of letters allowed? (b) without repetition of 
any letter? (c) without repetition and containing the letter e? (d) with repetition 
and containing e? 


(a) With repetition, we have six choices for each letter in the sequence. 
So by the multiplication principle there are 6 X 6 X 6 = 216 three-letter 
Sequences with repetition. 

(b) Without repetition, there are six choices for the first letter. For the 
second letter, there are five choices, the five remaining letters (no matter what 
the first choice was). Similarly for the third letter, there are four choices. Thus 
there are 6 X 5 X 4 = 120 three-letter sequences without repetition. 
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(c) It is often helpful to make a diagram displaying the positions ina 
sequence, even a sequence with just three letters: 


Such a diagram helps focus on choices involving the positions. If the sequence 
must contain e, then there are three choices for which position in the sequence 
is e, as shown in the following three diagrams: 


e e é 


In each diagram, there are 5 choices for which of the other 5 letters (excluding 
e) goes in the first remaining position and 4 choices for which of the remaining 
4 letters goes in the other position. Thus there are 3 5 X 4 = 60 three-letter 
sequences with e. A second approach to solving this problem is given in the 
exercises. 

(d) Let us try the approach used in part (c) when repetition is allowed. As 
before, there are three choices for e’s position. For any of these choices for 
e’s position, there are 6 X 6 = 36 choices for the other two positions, since 
e and the other letters can appear more than once. But the answer of 3 x 
36 = 108 is not correct. 

The Multiplication Principle has been violated because the outcomes are 
not distinct. Consider the sequence 


£€ fe 


It was generated two times in our procedure: once when e was put in the first 
position followed by c e as one of the 36 choices for the latter two positions, 
and a second time when e was put in the last position with e c in the first two 
positions. 

We must use an approach that ensures distinct outcomes. Let us break the 
problem into disjoint cases based on where the first e in the sequence occurs. 
First suppose the first e is in the first position: 


& 


Then there are six choices (including e) for the second and for the third 
positions—6 X 6 ways. 
Next suppose the first e is in the second position: 


Then there are five choices for the first position (cannot be e) and six choices 
for the last position—5 X 6 ways. Finally, let the first (and only) e be in the 
last position: 


& 


noe noe 


There are five choices each for the first two positions —5S X 5 ways. The correct 
answer is thus (6 X 6) + (5 X 6) + (5 X 5) = 91. = 
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The hardest part about solving most counting problems is finding a structure 
in the problem that allows it to be broken into subcases or stages. In other 
words, the difficulty is in ‘‘getting started.’’ At the same time, one must be 
sure that the decomposition into cases or stages generates outcomes that are 
all distinct. 

Most counting problems require their own special insights. Knowing the 
solution to problem A is typically of little help in solving problem B. Learning 
how to devise original solutions to counting problems is what a combinatorics 
course is all about. This skill cannot be acquired in reading textbook examples. 
It is only gained by working many problems oneself. 

The following example illustrates this type of special insight in problem 
solving. 


Example 5 


How many nonempty different collections can be formed from five (identical) 
apples and eight (identical) oranges? 

Readers with some experience in combinatorial problem solving may want 
to break the problem into subcases based on the number of objects in the 
collections. Any one of these subcases can be counted quite easily, but there 
are 13 possible subcases; that is, collections can have 1 or 2 or... up to 13 
pieces of fruit. 

In counting different possibilities, we must concentrate on what makes one 
collection different from another collection. The answer is, the number of apples 
and/or the number of oranges will be different in different collections. Then 
we can characterize any collection by a pair of integers (a, 0), where a is the 
number of apples and o is the number of oranges. 

Now the number of collections is easy to count. There are 6 possible values 
for a (including 0): 0, 1, 2, 3, 4, 5, and 9 possible values for o. Together there 
are 6 X 9 = 54 different collections. (Note that we multiply 6 and 9, not add 
them, because a collection combines any number of apples and any number of 
oranges; we add if we want to count the ways to get some amount of apples 
or some amount of oranges, but not both.) 

Since the problem asked for nonempty collections and one of the possibilit- 
ies allowed was (0,0), the desired answer is 54 — 1 = 53. = 


Here is one piece of advice to consider when you are stuck and cannot get 
started with a problem. Try writing down ina systematic fashion some (a dozen 
or so) of the possible outcomes you want to enumerate. In listing outcomes, 
you should start to see a pattern emerge. Think of your list as being part of 
One particular subcase. Then ask yourself, how many outcomes would your 
list need to include to complete that subcase? Next ask what other (hopefully 
similar) subcases need to be counted. Once a first subcase has been successfully 
analyzed, then other subcases are usually easier. 
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5.1 EXERCISES 


Mb Summary of Exercises The first eight exercises are straightfor- 


ward. Then the exercises become more challenging and require analysis that 
will be different for each problem. For the harder problems, readers must devise 
their own method of solution rather than mimic a method used in one of the 
text’s examples. All exercises should be read carefully two times to avoid 
misinterpretation. 

The word ‘‘between’’ is always used in the inclusive sense, that is, ‘‘integers 
between 0 and 50’’ means 0, 1, 2,..., 49, 50. 

Some problems include possible answers for comment. Try to infer from 
each expression the reasoning that would have generated such an answer. If 
the answer is wrong, point out the mistake in the reasoning. 


1. (a) How many ways are there to pick a sequence of two different letters 
of the alphabet that appear in the word BOAT? In MATHEMATICS? 
(b) How many ways are there to pick first a vowel and then a consonant 
from BOAT? From MATHEMATICS? 
2. (a) How many integers are there between 0 and 50 (inclusive)? 
(b) How many of these integers are divisible by 2? 
(c) How many (unordered) pairs of these integers are there whose differ- 
ence is 5? 


3. A store carries 8 styles of pants. For each style, there are 10 different 
possible waist sizes, 6 pants lengths, and 4 color choices. How many 
different types of pants could the store have? 


4. How many different sequences of heads and tails are possible if a coin is 
flipped 100 times? Using the fact that 2'° = 1024 = 1000 = 10°, give your 
answer in terms of an (approximate) power of 10. 


5. How many four-letter ‘‘words’’ (sequence of any four letters with repeti- 
tion) are there? How many with no repeated letters? 


6. How many ways are there to pick a man and a woman who are not husband 
and wife from a group of n married couples? 


7. Given eight different English books, seven different French books, and 
five different German books: 


(a) How many ways are there to select one book? 
(b) How many ways are there to select three books, one of each language? 


(c) How many ways are there to make a row of three books in which 
exactly one language is missing (the order of the three books makes 
a difference)? 


8. There are four different roads between town A and town B, three different 
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. roads between town B and town C, and two different roads between town 


A and town C. 
(a) How many different routés are there from A to C altogether? 


(b) How many different routes are there from A to C and back (any road 
can be used once in each direction)? 

(c) How many different routes are there from A to C and back in part (b) 
that visit B at least once? 


(d) How many different routes are there from A to C and back in part (b) 
that do not use any road twice? 


. How many ways are there to pick 2 different cards from a standard 52- 


card deck such that: 
(a) The first card is an Ace and the second card is not a Queen? 


(b) The first card is a spade and the second card is not a Queen? (Hint: 
Watch out for the Queen of spades.) 


. How many nonempty collections of letters can be formed from three As 


and five Bs? 


- How many ways are there to roll two dice to yield a sum divisible by 3? 


How many four-letter ‘“‘words’’ (sequences of letters with repetition) are 
there in which the first and last letter are vowels? In which vowels appear 
only (if at all) as the first and last letter? 

Comment on the answers to the second part: (i) 5°21, (ii) 5?26?, (iii) 
21°26’, and (iv) 264 — 21726?. 


- (a) How many different five-digit numbers are there (leading zeros, e.g., 


00174, not allowed)? 
(b) How many even five-digit numbers are there? 
(c) How many five-digit numbers are there with exactly one 3? 


(d) How many five-digit numbers are there that are the same when the 
order of their digits is inverted (e.g., 15251)? 


- How many different numbers can be formed by various arrangements of 


the six digits 1, 1, 1, 1, 2, 3? 


- What is the probability that the top two cards in a shuffled deck do not 


form a pair? 


- (a) How many different outcomes are possible when a pair of dice, one 


red and one white, are rolled two successive times? 


(b) What is the probability that each die shows the same value on the 
second roll as on the first roll? 


(c) What is the probability that the sum of the two dice is the same on 
both rolls? 
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17. 


18. 


19. 


20 


21. 


22. 


23. 


24. 


25. 


(d) What is the probability that the sum of the two dice is greater on the 
second roll? 


A rumor is spread randomly among a group of 10 people by successively 
having one person call someone, who calls someone, and so on. A person 
can pass the rumor on to anyone except the individual who just called. 


(a) By how many different paths can a rumor travel through the group in 
three calls? In n calls? 


(b) What is the probability that if A starts the rumor, A receives the third 
call? 


(c) What is the probability that if A does not start the rumor, A receives 
the third call? 


(a) How many different license plates involving three letters and three 
digits are there if the three letters appear together either at the beginning 
or end of the license? 


(b) How many license plates involving one, two, or three letters and one, 
two, or three digits are there if the letters must appear in a consecutive 
grouping? 


Re-solve the problem in Example 4 of counting the number of three-letter 
sequences without repetition using a, b, c, d, e, f that have an e by first 
counting the number with no e. 


What is the probability that the sum of two randomly chosen integers 
between 20 and 40 inclusive is even (the possibility of the two integers 
being equal is allowed)? 
Comment on the answers: (i) 1/2, (ii) 11/21, and (iii) 117/21’. 
How many three-letter sequences without repeated letters can be made 
using a, b, c, d, e, f in which either e or f (or both) is used? 
Comment on the answers: (i) 3 X 2 X 4 X 3, (ii) 3 X 2X 5 
(iii) @ X 2X 4x 4) — GB X 2 X 4), and (iv) (6 X 5 X 4)- 4X 
2). 
How many ternary (0, 1, 2) sequences of length 10 are there without any 
pair of consecutive digits the same? 


What is the probability that an integer between 1 and 10,000 has exactly 
one 8 and one 9? 
Comment on the answers: (i) 4 X 3/10‘, (ii) 4 * 3 X 8 X 7/10*, and 
(iii) 2 X 4 X 3 X 87/104. 
How many different five-letter sequences can be made using the letters A, 
B, C, D with repetition such that the sequence does not include the word 
BAD—that is, sequences such as ABADD are excluded. 
(a) How many election outcomes are possible with 20 people each voting 
for one of seven candidates (the outcome includes not just the totals 
but also who voted for each candidate)? 


x 4, 
3 x 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37, 


38. 


39. 


40. 


5.1 Two Basic Counting Principles 179 


(b) How many election outcomes are possible if only one person votes 
for candidate A and only one person votes for candidate D? 


There are 15 different apples and 10 different pears. How many ways are 
there for Jack to pick an apple or a pear and then for Jill to pick an apple 
and a pear? 


How many times is the digit 5 written when listing all numbers from 1 to 
100,000? 

Comment on the answers: (a) 4, (b) 5 X 10‘, and (c) 1 + 10 + 
100 + 1000 + 10,000. 


What is the probability that if one letter is chosen at random from the word 
RECURRENCE and one letter is chosen from RELATION, the two letters 
are the same? 


How many four-digit numbers are there formed from the digits 1, 2, 3, 4, 
5 (with possible repetition) that are divisible by 4? 

How many nonempty collections of letters can be formed from n As, n 
Bs, n Cs and n Ds? 


There are 50 cards numbered from 1 to 50. Two different cards are chosen 
at random. What is the probability that one number is twice the other 
number? 


If two different integers between 1 and 100 inclusive are chosen at random, 
what is the probability that the difference of the two numbers is 15? 


If three distinct dice are rolled, what is the probability that the highest 
value is twice the smallest value? 


How many different numbers can be formed by the product of two or more 
of the numbers 3, 4, 4, 5, 5, 6, 7, 7, 72 


How many n-digit binary sequences are there without any pair of consecu- 
tive digits being the same? 


A chain letter is sent to five people in the first week of the year. The next 
week each person who received a letter sends letters to five new people, 
and so on. How many people have received letters in the first five weeks? 


How many ways are there to place two identical rooks in a common TOW 
or column of an 8 X 8 chessboard? an n X m chessboard? 


How many ways are there to place two identical kings on an 8 X 8 
chessboard so that the kings are not in adjacent squares? on ann X m 
chessboard? 


How many ways are there to place two identical queens on an 8 X 8 
chessboard so that the queens are not in a common row, column, or diag- 
onal? 


How many different positive integers can be obtained as a sum of two or 
more of the numbers 1, 3, 5, 10, 20, 50, 82? 
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41. 


42. 


43. 


44. 


45. 


How many ways are there for a man to invite some (nonempty) subset of 
his 10 friends to dinner? 


How many different rectangles can be drawn on an 8 X 8 chessboard (the 
rectangles could have sides of length 1 through 8; two rectangles are 
different if they contain different subsets of individual squares)? 


How many ways are there to place a red checker and a black checker on 
two black squares of a checkerboard so that the red checker can jump over 
the black checker? (A checker jumps on the diagonal from in front to 
behind.) 


Use induction to verify formally: 
(a) The addition principle. 
(b) The multiplication principle. 


On the real line, place n white pegs at positions 1, 2, ..., and n blue 
pegs at positions —1, —2,..., —n (0 is open). Whites move only to the 
left, blues to the right. When beside an open position, a peg may move 
one unit to occupy that position (provided it is in the required direction). 
If a peg of one color is in front of a peg of the other color that is followed 
by an open position (in the required direction), a peg may jump two units 
to the open position (the jumped peg is not removed). By a sequence of 
these two types of moves (not necessarily alternating between white and 
blue pegs), one seeks to get the positions of the white and blue pegs 
interchanged. (See the article on this game in Mathematics Teacher, January 
1982.) 


(a) Play this game for n = 3 and n = 4. 


(b) Use a combinatorial argument to show that, in general, n’ + 2n moves 
(unit steps and jumps) are required to complete the game. 


5.22 SIMPLE ARRANGEMENTS 
AND SELECTIONS 


A permutation of n distinct objects is an arrangement, or ordering, of the n 
objects. An r-permutation of n distinct objects is an arrangement using r of 
the n objects. An r-combination of n distinct objects is an unordered selection, 
or subset, of r out of the n objects. We use P(n, r) and C(n, r) to denote the 
number of r-permutations and r-combinations, respectively, of a set of n objects. 
From the multiplication principle we obtain 


P(n,2) = n(n — 1), P(n, 3) = n(n — 1)(n — 2), 
P(n,n) = n(n —-1)n—-2)X+++X3X2X1 
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In enumerating all permutations of n objects, we have n choices for the 
first position in the arrangement, n — 1 choices (the n — 1 remaining objects) 
for the second position, . . . , and finally one choice for the last position. Using 
the notation n! = n(n — 1)(n — 2)... X 3 X 2 X 1 (nl is said ‘‘n factorial’’), 
we have the formulas 


P(n, n) = ni 
and 


n! 
(n—r)! 


P(n,r) = n(n - 1)(n-—2)X+++X [n-(r-1D] = 


Our formula for P(n, r) can be used to derive a formula for C(n, r). All 
r-permutations of n objects can be generated by first picking any r-combina- 
tion of the n objects and then arranging these r objects in any order. Thus 
P(n, r) = C(n, r) X P(r, r), and solving for C(n, r) we have 


P(n,r) = ail(n—r)t _ n! 


C(n, r) = P(r,r) r! = rin —r)! 


The numbers C(n, r) are frequently called binomial coefficients because of 
their role in the binomial expansion (x + y)". We study the binomial expansion 
and identities involving binomial coefficients in Section 5.5. It is common 


. < . n “e 29 
practice to write the expression C(n, r) as e and to say ‘‘n choose r’’ [we 


usually write C(n, r) in this book]. 

Note that C(n, r) = C(n, n — r)—the number of ways to pick a subset of 
r out of n distinct objects equals the number of ways to throw away a subset 
of n — r. 


Example 1 


How many ways are there to rank n candidates for the job of chief wizard? If 
the ranking is made at random (each ranking is equally likely), what is the 
probability that the fifth candidate, Gandalf, is in second place? 

A ranking is simply an arrangement, or permutation, of the n candidates. 
So there are n! rankings. Intuitively, a random ranking should randomly position 
Gandalf, and so we would expect that he has probability 1/n of being second, 
or being in any given position. (Equivalently, each of the n candidates should 
have the same probability of being in second place.) Formally, we calculate 
the probability of Gandalf being second using the probability formula for an 
event: the fraction of outcomes producing the desired event divided by all 
possible outcomes (see Appendix A3 for details). 


bsp : ; 
Prob(Gandalf second) = Be ee IU anal second Tea ee second 
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In arrangement problems we have the option of either picking which item 
goes in the first position, then which item goes in the second position, and so 
on, or picking which position to choose for the first item, which position to 
choose for the second item, and so on. In the Gandalf-second problem, we pick 
which positions for the items: first put Gandalf in second place—1 way—and 
then put the remaining wizards in the remaining places—(n — 1)! ways. So 
Gandalf is second in (n — 1)! rankings, and Prob(Gandalf second) = (n — 1)!/ 
n! = I/n, as expected. = 


Note that to count arrangements where a particular position is fixed—in 
the preceding example, that Gandalf is in second place—we do not count the 
ways to make Gandalf second (there is only one way to put Gandalf in second 
place) but rather count all the ways to arrange the remaining wizards in the 
remaining positions. We count what has not been constrained. 


Example 2 


How many ways are there to arrange the seven letters in the word SYSTEMS? 

The difficulty of multiple Ss can be avoided by picking the positions that 
the other four letters E, M, T, Y will occupy in the seven-letter arrangement, 
and then the 3 Ss will fill the remaining three positions in one way. There are 
7 possible positions for E, 6 for M, 5 for T, and 4 for Y. Thus there are P(7, 
4) = 7!/3! = 840 arrangements. (A general formula for counting arrangements 
with repeated objects is given in the next section.) 

Here again we counted arrangements in terms of where successive items 
should be placed rather than what should go in successive positions. m= 


Example 3 


(a) How many 5-card hands (subsets) can be formed from a standard 52-card 
deck? 


(b) If a 5-card hand is chosen at random, what is the probability of obtaining 
a flush (all 5 cards in the hand are in the same suit)? 


(c) What is the probability of obtaining 3, but not 4, Aces? 


(a) A 5-card hand is a subset of 5 cards, and so there are C(52, 5) = 52!/ 
(47!5!) = 2,598,960 different 5-card hands. 

(b) To find the probability of a flush, we need to find the number of 5- 
card subsets with all cards of the same suit. There are 4 suits, and a subset of 
5 cards from the 13 cards in a suit can be chosen in C(13, 5) = 13!/(5!8!) = 
1287 ways. So there are 4 X 1287 = 5148 flushes, and 


5148 


Prob(5-card hand is a flush) = 2.598.960 


= 0.00198 (= 0.2%) 
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(c) To count the number of hands with exactly 3 Aces, we must pick 3 of 
the 4 Aces—done in C(4,3) = 4 ways—and then fill out the hand with 2 cards 
chosen from the 48 non-Ace cards—done in C(48, 2) = 1128 ways. So there 
are 4 X 1128 = 4512 hands with exactly 3 Aces, and 

4512 


Prob(5-card hand has exactly 3 Aces) = 2598960 ~ .00174 » 


Note that to count hands with 3 Aces in Example 3, we implicitly used 
the multiplication principle to multiply the ways to pick 3 Aces times the ways 
to fill out the hand with 2 non-Ace cards. However, a hand is an unordered 
collection and by ordering it into two parts, we might generate two outcomes that 
are really the same set, violating the distinctness condition of the Multiplication 
Principle. In this problem, hands could safely be decomposed into an Aces part 
and a non-Aces part. The next example illustrates this disjointness issue more 
fully. 


Example 4 


A committee of k people is to be chosen from a set of 7 women and 4 men. 
How many ways are there to form the committee if: 


(a) The committee has 5 people, 3 women and 2 men. 


(b) The committee can be any positive size but must have equal numbers of 
women and men. 


(c) The committee has 4 people and one of them must be Mr. Baggins. 
(d) The committee has 4 people and at least 2 are women. 


(e) The committee has 4 people, two of each sex, and Mr. and Mrs. Baggins 
cannot both be on the committee. 


(a) Applying the Multiplication Principle to disjoint collections, we can 
count all committees of 3 women and 2 men by composing all subsets of 3 
women with all subsets of 2 men, C(7, 3) X C(4, 2) = 35 X 6 = 210 ways. 

(b) To count the possible subsets of women and men of equal size on the 
committee, we must know definite sizes of the subsets. That is, we must break 
the problem into the four disjoint subcases of: 1 woman and | man, 2 each, 3 
each, and 4 each (there are only 4 men available). So the total number is the 
sum of the possibilities for these four subcases, [C(7, 1) X C(4, 1)] + [C(7, 
2) X C(4, 2)] + [C(7, 3) X C(4, 3)] + [C(7, 4) x C14, 4] = 7 X 4 + 21 X 
6+ 35 X 4+ 35 X 1 = 329. 

(c) If Mr. Baggins must be on the committee, this simply means that the 
problem reduces to picking 3 other people on the 4-person committee from the 
remaining 10 people (7 women and 3 men). So far we have made all selections 
from the set of men or the set of women. Now the other three people must be 
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chosen from the set of all remaining people, and so the answer is C(10, 3) = 
120. (It is easy to get in a mindset where one uses the same sets in the current 
problem that one used in the previous problem—problem solvers must always 
be alert to this trap.) 

(d) One approach is to pick 2 women first, C(7, 2) = 21 ways, and then 
pick any 2 of the remaining set of 9 people (5 women and 4 men). However, 
counting all committees in this fashion counts some outcomes more than once, 
since any woman in one of these committees could be chosen as either one of 
the first 2 women or one of the 2 remaining people. For example, if W; denotes 
the ith woman and M, the ith man, then (W,, W3;) composed with the 2 remaining 
people (W>, M3) yields the same set as (W,, W2) composed with (W;, Ms). 

A correct solution to this problem must use a subcase approach, as in part 
(b). That is, break the problem into three subcases that specify exactly how 
many women and how many men are on the committee: 2 women and 2 men, 
3 women and 1 man, and 4 women. The answer is thus [C(7, 2) xX C(4, 
2)] + [C(7, 3) X C(4, 1)] + CO, 4 = 21 XK 6 + 35 X 4 + 35 = 301. 

(e) We need to recast the condition ‘‘Mr. and Mrs. Baggins cannot both 
be on the committee’’ into several subcases in which we know exactly which 
of Mr. and Mrs. Baggins is on the committee. Note that the possibility of 
neither Mr. nor Mrs. Baggins’s being on the committee satisfies the given 
condition. There are three subcases to consider. 

The first subcase is, Mrs. Baggins is on the committee and Mr. Baggins 
is not. Then 1 more woman must be chosen from the remaining 6 women and 
2 more men must be chosen from the remaining 3 men (Mr. Baggins is 
excluded). This can be done in C(6, 1) X C(3, 2) = 6 X 3 = 18 ways. The 
other two cases are: Mrs. Baggins is off and Mr. Baggins is on, which by a 
similar argument yields C(6, 2) X C(3, 1) = 15 X 3 = 45 ways; and neither 
is on the committee, C(6, 2) X C(3, 2) = 15 X 3 = 45 ways. The total answer 
is 18 + 45+ 45 = 108. 

An easier solution to this problem can be obtained by taking a complemen- 
tary approach. We can consider all C(7, 2) X C(4, 2) 2-women—2-men commit- 
tees and then subtract the forbidden committees that contain both Bagginses. 
The forbidden committees are formed by picking one more woman and one 
more man to join Mr. and Mrs. Baggins—done in C(6, 1) X C(3, 1) ways. 
This approach yields a simpler formula for the same answer: 21 X 6 — 6 X 
3 = 108. = 


The problem of counting the same outcome twice, which arose in part (d) 
of Example 4, arises in many guises. The following principle should help the 
reader avoid this problem. 


The Set Composition Principle 


Suppose a set of distinct objects is being enumerated using the Multiplication 
Principle, multiplying the number of ways to form some first part of the set 
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by the number of ways to form a second part (for a given first part). The 
members of the first and second parts must be disjoint. Another way to express 
this condition is: Given any set S thus constructed, one must be able to tell 
uniquely which elements of S are in the first part of S and which elements are 
in the second part. 


In Example 4(a), the part W of 3 women and the part M of 2 men are 
disjoint, and so by the set composition principle the number of committees 
with 3 women and 2 men is the size of W times the size of M. In the ‘‘committee 
with at least two women’’ problem in Example 4(d), the method of choosing 
two of the seven women first and then picking any remaining two people 
(women or men) violates the Set Composition Principle because the members 
of the two parts are not disjoint—a woman could be in either the first or second 
part. For example, in the committee {W,, W2, W;, M3} it is impossible to say 
which two women are chosen in the first part. The scheme of picking two 
women and then any two remaining people generates this committee in C(3, 
2) different ways, namely, all compositions of: (i) two of women W,, W2, Ws: 
and (ii) M; plus the woman in W,, W2, W; not chosen in (i). 


Example 5 


How many different 8-digit binary sequences are there with six 1s and two 0s? 

What distinguishes one 8-digit binary sequence with six 1s from another 
such sequence? Obviously, the answer is the positions of the six 1s (or the two 
Os). Though this problem initially reads as an arrangement problem, what must 
be counted is the different possible positions of the six 1s, that is, different 
possible subsets of six of the eight positions in the binary sequence. (A sequence 
with 1s in positions 1, 2, 3, 4, 5, 6 is the same as a sequence with 1s in positions 
6, 5, 2, 3, 4, 1—the order does not matter, just the collection of positions 
involved). So the answer is C(8, 6) = 28. 

We could alternatively have focused on picking a subset of two of the 
eight positions for Os. = 


The following type of counting problem arises frequently in quality-control 
problems. 


Example 6 


A manufacturing plant produces ovens. At the last stage, an inspector marks 
the ovens A (acceptable) or U (unacceptable). How many different sequences 
of 15 As and Us are possible in which the third U appears at twelfth letter in 
the sequence? 

This problem is a binary sequence problem similar to Example 5 except 
now the elements are A and U, rather than 0 and 1. If the third U appears at 
the twelfth letter in the sequence, then the subsequence composed of the first 
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11 letters must have exactly 2 Us (and 9 As). Following the reasoning in 
Example 5, there are C(11, 2) = 55 possible sequences for the first 11 letters. 
There is one way to pick the twelfth letter—it is specified to be U. The remaining 
3 letters in the sequence can be either A or U—2? = 8 possibilities. All together, 
there are 55 X | X 8 = 440 sequences. = 


Example 7 Optional (requires that Section 2.3 was previously covered) 


The chromatic polynomial P,(G) of a graph G is a polynomial in & that gives 
the number of k-colorings of G. What is the chromatic polynomial of: 


(a) A complete graph K; on five vertices (all vertices adjacent to each other)? 
(b) The graph C, of a circuit of length 4? 


(a) In a complete graph, each vertex must be a different color. Thus, 
P.(Ks) = k(k — 1)(k — 2)(k — 3)(k — 4) = P(K, 4), since there are k possible 
choices for the first vertex to be colored; then that color cannot be used again, 
and so the second vertex to be colored has k — 1 choices, and so on. 

(b) Let the vertices on the circuit C, be named x, X%2, x3, X, with edges (x, 
X2), (Xa, X3), (3, %4), (X4, X)). We break the computation of P,(C,) into two cases, 
depending on whether or not x, and x; are given the same color. 

If x, and x; have the same color, there are k choices for the color of these 
two vertices. Then x, and x, each must only avoid the common color of x, and 
x,;—k — 1 color choices each. So the number of k-colorings of C, in this case 
is k(k — 1). 

If x; and x; have different colors, there are k(k — 1) choices for the two 
different colors for x, and then x;. Now x, and x, each have k — 2 color choices. 
So in this case the total number of k-colorings of C, is k(k — 1)(k — 2). 
Combining the two cases, we obtain P,(C,) = k(k — 1)? + k(k — 1)(k — 2). = 


Example 8 


What is the probability that a 4-digit campus telephone number has one or 
more repeated digits? 

There are 10‘ = 10,000 different 4-digit phone numbers. We break the 
problem of counting 4-digit phone numbers with repeated digits into four 
different cases of repetitions: 


(a) All 4 digits the same. 
(b) 3 digits the same, the other different. 
(c) 2 digits the same, the other 2 digits also the same (e.g., 2828). 
(d) 2 digits the same, the other 2 digits each different (e.g., 5105). 
(a) There are 10 numbers formed by repeating one of the 10 digits four 
times. 


(b) One way to decompose the process of generating these case-(b) num- 
bers is as follows (several other decompositions are possible). First pick which 
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digit appears once—10 choices—then where it occurs in the 4-digit number— 
4 choices—and finally which other digit appears in the other three positions— 
9 choices. This yields 10 X 4 X 9 = 360 numbers. 

(c) First pick which two digits are each to appear twice—C(10, 2) = 45 
choices—and then how to arrange these 4 digits: pick which two positions are 
used by, say, the smaller of the digits—C(4, 2) = 6 ways. This yields 45 x 
6 = 270 numbers. 

(d) First pick which pair of digits appear once—C(10, 2) = 45 choices— 
then pick a position for, say, the smaller of these two digits and a position for 
the larger digit—4 x 3 = 12 choices—and finally pick which other digit 
appears in the remaining two positions—8 choices. This yields 45 X 12 X 
8 = 4320 numbers. 


In sum, there are 10 + 360 + 270 + 4320 = 4960 4-digit phone num- 
bers with a repeated digit. The probability of a repeated digit is thus 4960/ 
10,000 = 0.496 = 0.5. 

We note that there happens to be a simpler way, using the complementary 
set, to count phone numbers with repeated digits: count numbers with no 
repeated digits. These are just the P(10, 4) = 5040 four-permutations of 
the 10 digits. So the remaining repeated-digit numbers amount to 10,000 — 
5040 = 4960. = 


One point of caution about cases (c) and (d) where two different digits 
both occur once or both occur twice. In case (d), we pick the two digits occurring 
once as an unordered pair in C(10, 2) ways and arrange those digits (and then 
pick the digit to go in the remaining two positions) rather than pick a first digit, 
position it, then pick a second digit, position it (and then pick the digit to go 
in the remaining two positions)—10 <X 4 X 9 X 3 X 8 ways. In this latter 
(wrong) approach, we cannot tell for a telephone number such as 2529 whether 
the 5 was chosen first and put in the second position and then the 9 chosen 
next and put in the fourth position, or whether the 9 was chosen first and put 
in the fourth position and then the 5 chosen next and put in the second position. 
The Set Composition Principle is being violated and each such outcome would 
be counted twice. 


Example 9 


We consider a way of measuring the influence of different players in weighted 
voting. Suppose that in a 5-person regional council there are 3 representatives 
from small towns, call them a, b, c, who each cast one vote, and there are 2 
representatives from large towns, call them D, E, who each cast two votes. 
With a total of 7 votes cast, it takes 4 votes (a majority of votes) in favor of 
legislation to enact it. Suppose that in forming a coalition to vote for some 
legislation, the people join the coalition in order (an arrangement of the people). 
The pivotal person in a coalition arrangement is the person whose vote brings 
the number of votes in the coalition up to 4. For example, in the coalition 
arrangement bDcaE, the pivotal person is c. A measure of the ‘‘power’’ of a 
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person p in the council is the fraction of coalition arrangements in which p is 
the pivotal person. This measure of power is called the Shapley—Shubik index. 

Determine the Shapley—Shubik index of person a and person D in this 
council, that is, determine the fraction of all coalition arrangements in which 
a and D, respectively, are pivotal. (By symmetry, the 1-vote people b and c 
will have the same index as a, and similarly E will have the same index as D.) 

If a is pivotal in a coalition arrangement, then people with (exactly) 3 votes 
must precede a in the arrangement and people with 3 votes must follow a. 
Since there are only two other 1-vote people, the 3 votes preceding a must 
come from one 1-vote person—b or c—and one 2-vote person—D or E. Then 
the beginning of the coalition can be formed in 2 (choice of 1-vote person) X 
2 (choice of 2-vote person) X 2 (whether 1-vote or 2-vote person goes 
first) = 8 ways. The remaining 1-vote and remaining 2-vote person will follow 
a, with 2 ways to arrange them. In total there are 8 X 2 = 16 coalition 
arrangements in which a is pivotal. There are 5! = 120 arrangements in all, 
and so the Shapley—Shubik index of a is 16/120 = 4/30. 

If D is pivotal in a coalition arrangement, there can be people with 2 or 3 
votes preceding D. Suppose there are 2 votes before D and 3 votes after D. 
Either the arrangement starts with 2 of the three 1-vote people—arranged 3 X 
2 ways—then D, followed by the other 1-vote person and E in either order— 
2 ways—or the arrangement starts with E, then D, followed by an arrangement 
of the three 1-vote people—3! ways. In total, there are (3 X 2 X 2) + 3! = 
18 arrangements with 2 votes before D and 3 votes after D. By interchanging 
the people before D with the people after D in these arrangements, we obtain 
the arrangements with of 3 votes before D and 2 votes after D. So there are 
18 of the latter arrangements. In total, there are 18 + 18 = 36 arrangements 
in which D is pivotal, and so D’s Shapley—Shubik index is 36/120 = 9/30. 

Observe that a 2-vote person has an index 2% times the size of a 1-vote 
person. = 


We close this section by noting that for large values of n, n! can be 
approximated by the number s, = V27n (n/e)", where e is Euler’s constant 
(e = 2.718 ...). This approximation is due to Stirling and its derivation is 
given in most advanced calculus texts (see Buck [1]). The error |n! — s,| 
increases as n increases, but the relative error |n! — s,|/s, is always less than 
1/11n. The following table gives some sample values of n!, s,, and |n! — s,|/ 
Sis 


n n! Sn In! — s,\/s, 
1 1 922 .085 
2 2 1.919 .042 
5 120 118.02 .017 
10 3,628,800 3,598,600 .008 
20 2.433 X 108 2.423 x 1038 .004 


100 9.333 X 10°” 9.328 x 10'” .0005 
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5.2 EXERCISES 


yy Summary of Exercises As in the previous section, most of 


these exercises require individual analysis, different for each problem. Remem- 
ber to read problems carefully to avoid misinterpretation. Pay special attention 
to whether a problem involves arrangements or subsets. Note that the problems 
assume that people are distinct objects (no identical people). 


1. How many ways are there to arrange the cards in a 52-card deck? 


2. How many different 10-letter ‘‘words’’ (sequences) are there with no 
repeated letters formed from the 26-letter alphabet? 


66 


3. How many ways are there to distribute nine different books among 15 
children if no child gets more than one book? 
Comment on the answers: (i) C(15, 9), (ii) 15°, (iii) C(15, 9)9!. 
4. How many ways are there to seat five different boys and five different girls 
along one side of a long table with 10 seats? How many ways if boys and 
girls alternate seats? 


5. How many arrangements are there of the 8 letters in the word VISITING? 


6. How many ways are there to pick a subset of 4 different letters from the 
26-letter alphabet? 

7. How many ways are there to pick a 5-person basketball team from 10 
possible players? How many teams if the weakest player and the strongest 
player must be on the team? 

8. There are 10 white balls and 5 red balls in an urn. How many different 
ways are there to select a subset of 5 balls, assuming the 15 balls are 
different? What is the probability that the selection has 3 whites and 2 
reds? 

9. If a coin is flipped 10 times, what is the probability of 8 or more heads? 

10. (a) On a 10-question test, how many ways are there to answer exactly 
eight questions correctly? 
(b) Repeat part (a) with the requirement that the first or second question, 
but not both, are answered correctly. 


(c) Repeat part (a) in the case that three of the first five questions are 
answered correctly. 


11. How many n-digit ternary (0, 1, 2) sequences with exactly nine Os are 


there? 

12. What is the probability that a five-card poker hand has the following? 
(a) Four Aces (d) A full house (three of a kind and a pair) 
(b) Four of a kind (e) A straight (a set of five consecutive 


values) 


(c) Two pairs (not four of a (f) No pairs (possibly a straight or flush) 
kind or a full house) 
ea et ee RO ck ee P ee eine RO Pan TOE TREN Te EE On PN ERY PSNR PG BORN Le ee ee ee Pe AN GP 
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13. 


14. 


15. 


16 


17. 


18. 


19. 


20. 


21. 


How many ways can a committee be formed from four men and six women 
with: 
(a) At least two men and at least twice as many women as men? 


(b) Four members, at least two of whom are women, and Mr. and Mrs. 
Baggins cannot both be chosen? 


There are eight applicants for the job of dog catcher and three different 
judges who each rank the applicants. Applicants are chosen if and only if 
they appear in the top three in all three rankings. 
(a) How many ways can the three judges produce their three rankings? 
(b) What is the probability of Mr. Dickens, one of the applicants, being 
chosen in a random set of three rankings? 
Comment on the answers for part (b) (numerator only): (i) 3(” — 
1)!, Gi) [31 — 3)!P, Git) 3l~@ — IP. 

There are six different French books, eight different Russian books, and 
five different Spanish books. How many ways are there to arrange the 
books in a row on a shelf with all books of the same language grouped 
together? 
If 13 players are each dealt 4 cards from a 52-card deck, what is the 
probability that each player gets one card of each suit? 
How many 10-letter (sequences) are there using 5 different vowels and 5 
different consonants (chosen from the 21 possible consonants)? What is the 
probability that one of these words has no consecutive pair of consonants? 
What is the probability that a random 9-digit Social Security number has 
at least one repeated digit? 
What is the probability that an arrangement of a, b, c, d, e, f has 
(a) a and b side-by-side? 
(b) a occurring somewhere before b? 


How many ways are there to pair off 10 women at a dance with 10 out of 

20 available men? 

How many triangles are formed by (assuming no three lines cross at a 

point): 

(a) Pieces of n nonparallel lines; for example, the four lines below form 
four triangles: acd, abf, efd, and ebc? 


(b) Pieces of n lines, m of which are parallel and the others mutually 
nonparallel? 


28. 


29. 


30. 


31. 


32. 
33. 
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. There are three women and five men who will split up into two four-person 


teams. How many ways are there to do this so that there is (at least) one 
woman on each team? 


. A man has n friends and invites a different subset of four of them to his 


house every night for a year (365 nights). How large must n be? 


. How many arrangements of JUPITER are there with the vowels occurring 


in alphabetic order? 
Comment on the answer: (i) 3!4!, (ii) 7!/3!, (iii) C(7, 3)4!. 


. Determine the Shapley—Shubik index of a 1-vote person and a 2-vote 


person in the councils with the following make-up: 

(a) Three 1-vote people and one 2-vote person. 

(b) Two 1-vote people and two 2-vote people (majority is 4). 

(c) Four 1-vote people and three 2-vote people (majority is 6). 
(d) Four 1-vote people and four 2-vote people (majority is 7). 


. How many ‘“‘words’’ can be formed by rearranging INQUISITIVE so that 


U does not immediately follow Q? 


. Suppose a subset of 60 different days of the year are selected at random 


in a lottery. What is the probability that there are 5 days from each month 
in the subset? (For simplicity, assume a year has 12 months with 30 days 
each.) 


Suppose a subset of 8 different days of the year are selected at random. 
What is the probability that each day is from a different month? (For 
simplicity, assume a year has 12 months with 30 days each.) 


What is the probability of randomly choosing a permutation of the 10 digits 

0, 1, 2,..., 9 in which: 

(a) An odd digit is in the first position and 1, 2, 3, 4, or 5 is in the last 
position? 

(b) 5 is not in the first position and 9 is not in the last position? 

(a) What is the probability that a five-card hand has at least one card of 
each suit? 

(b) Repeat for a six-card hand. 


What is the probability that a five-card hand has 
(a) At least one of each of the four values: Ace, King, Queen, and Jack? 
(b) The same number of hearts and spades? 


How many n-digit decimal (0, 1, 2) sequences are there with k 1s? 
What fraction of all arrangements of INSTRUCTOR have: 


(a) Three consecutive vowels? 
(b) Two consecutive vowels? 
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34. 


35. 


36. 


37. 


38. 


39. 


40. 


41 


42. 


43. 


44, 


45. 


46. 


47. 


48. 


If one quarter of all 3-subsets of the integers 1, 2, ..., m contain the 
integer 5, determine m. 


How many ways are there to form an (unordered) collection of 4 pairs of 
2 people chosen from a group of 30 people? 


What fraction of all arrangements of GRACEFUL have: 
(a) F and G appear side by side? 
(b) No pair of consecutive vowels? 


How many arrangements of SYSTEMATIC are there in which each S is 
followed by a vowel (this includes Y)? 


How many arrangements of MATHEMATICS are there in which each 
consonant is adjacent to a vowel? 


(a) What is the probability that & is the smallest integer in a subset of four 
different numbers chosen from 1 through 20 (1 = k = 17)? 


(b) What is the probability that k is the second smallest? 


What is the probability that the difference between the largest and the 
smallest numbers is k in a subset of four different numbers chosen from 
1 through 20 (3 = k = 19)? 


How many ways are there to place eight identical black pieces and eight 
identical white pieces on an 8 X 8 chessboard? How many ways are there 
that are symmetric so that the layout looks the same if the chessboard is 
rotated 180°? 


The U. S. Senate randomly creates a special committee with k Senators. 
What is the probability that the committee has a Senator from Kansas? 


What is the probability that 2 (or more) people in a random group of 25 
people have a common birthday? (This is the famous Birthday Paradox 
Problem.) 


What is the probability that a random arrangement of REPLETE has no 
consecutive Es? 

What is the probability that if a die is rolled a sequence of five times only 
two different values appear? 

A family has two boys and three girls to send to private schools. There 
are five boys’ schools, eight girls’ schools, and three coed schools. If each 
child goes to a different school, how many different subsets of five schools 
can the family choose for their children? 

Given a collection of 2n objects, n identical and the other n all distinct, 
how many different subcollections of n objects are there? 

A batch of 50 different automatic typewriters contains exactly 10 defective 
machines. What is the probability of finding: 

(a) At least 1 defective machine in a random group of 5 machines? 
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(b) At least 2 defective machines in a random group of 10 machines? 


(c) The first defective machine to be the kth machine taken apart for 
inspection in a random sequence of machines? 


(d) The last defective machine to be the kth machine taken apart? 
49. Ten fish are caught in a lake, marked, and then returned to the lake. Two 
days later 20 fish are again caught, 2 of which have been marked. 


(a) Find the probability of 2 of the 20 fish being marked if the lake has 
k fish (assuming the fish are caught at random). 


(b) What value of k maximizes the probability? 


50. How many different Hamilton paths are there in each of the following 
graphs? 


(a) (b) (ce) 


51. How many different 4-colorings do each of the graphs (a) and (b) in 
Exercise 50 have? 


Determine the chromatic polynomial for the following graphs: 
(a) K,, a complete graph on n vertices. 
(b) Any n-vertex tree. 


(c) (d) 


53. How many ways are there for a woman to invite different subsets of three 
of her five friends on three successive days? How many ways if she has 
n friends? 


52 


54. For the given map of roads between city A and city B, 


CTL 


(a) How many routes are there from A to B that do not repeat any road 
(a road is a line segment between two intersections)? 


(b) How many ways are there for two people to go from A to B without 
both ever traversing the same road in the same direction? 
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55. 


56. 


57 


58. 


59. 


60. 


61. 


62. 


63. 


What is the probability that a random five-card hand has: 
(a) Exactly one pair (no three of a kind or two pairs)? 


as {lS 4 48 52 
Comment on the answers: (i) ( 1 ) Xx e x (‘ )/( 5 ), 

(ii) 13 x (5) x 48 X 44 x oof (®), 

52 X 3 X 48 X 44 x 40 

52 X 51 X 50 X 49 X 48 

(b) One pair or more (three of a kind, two pairs, four of a kind, full house)? 


(c) The cards dealt in order of decreasing value? 


(d) At least one spade, at least one heart, no diamonds or clubs, and the 
values of the spades are all greater than the values of the hearts? 


(iii) 


How many ways are there to pick a group of n people from 100 people 
(each of a different height) and then pick a second group of m other people 
such that all people in the first group are taller than the people in the second 
group? 

How many subsets of three different integers between 1 and 90 inclusive 
are there whose sum is: 

(a) An even number? 

(b) Divisible by 3? 

(c) Divisible by 4? 

What is the probability that two different letters selected from MISSISSIPPI 
are the same? That three letters are all different? 

If a coin is tossed eight times, what is the probability of getting: 

(a) Exactly four heads in a row? 

(b) At least four heads in a row? 

How many arrangements of INSTRUCTOR are there in which there are 
two consonants between successive pairs of vowels? 


How many triangles can be formed by joining different sets of three corners 

of an octagon? How many triangles if no pair of adjacent corners is per- 

mitted? 

How many arrangements of five Os and ten 1s are there with no pair of 

consecutive Os? 

(a) How many points of intersection are formed by the chords of an n- 
gon (assuming no three of these lines cross at one point)? 

(b) Into how many line segments are the lines in part (a) cut by the 
intersection points? 
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(c) Use Euler’s formula r = e — v + 2 and parts (a) and (b) to determine 
the number of regions formed by the chords of an n-gon. 


64. Given 14 positive integers, 12 of which are even, and 16 negative integers, 
11 of which are even, how many ways are there to pick 12 numbers from 
this collection of 30 integers such that 6 of the 12 numbers are positive 
and 6 of the 12 numbers are even? 


65. How many triangles are formed by (assuming no three lines cross at a 
point): 
(a) Pieces of three chords of a convex 10-gon such that the triangles are 
wholly within the 10-gon (a corner of the 10-gon cannot be a corner 
of any of these triangles)? 


(b) Pieces of three chords or outside edges of a convex n-gon? 


66. How. many arrangements of the integers 1, 2, ---, are there such that 
each integer differs by one (except the first integer) from some integer to 
the left of it in the arrangement? 


67. A man has seven friends. How many ways are there to invite a different 
subset of three of these friends for a dinner on seven successive nights 
such that each pair of friends are together at just one dinner? 


5.3 ARRANGEMENTS AND SELECTIONS 
WITH REPETITIONS 


In this section we discuss arrangements and selections with repetition—arrange- 
ments of a collection with repeated objects, such as the collection b, a, n, a, 
n, a, and selections when an object can be chosen more than once, such as 
ordering six hot dogs chosen from three varieties. We motivate the formulas 
for these counting problems with the two examples just mentioned. 


Example 1 


How many arrangements are there of the six letters b, a, n, a, n, a? 
Consider a possible arrangement: 


This problem is solved by an extension of reasoning used to solve the problem 
of counting 8-digit binary sequences with six Is (Example 5 in Section 5.2). 
The key is to focus on the subset of positions where the as go and the subset 
of positions where the ms go. For example, the above arrangement is character- 
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ized by having as in positions 2, 5, 6, ns in positions 1, 4, and 4 in position 
3. We count the arrangements by first choosing the subset of three positions 
in the arrangement where the as will go—C(6, 3) = 20 ways—then the subset 
of two positions (out of the remaining three) where the ns will go—C(3, 2) 
= 3 ways—and finally the last remaining position gets the b—1 way. Thus 
there are C(6, 3) X C(, 2) X CC, 1) = 20 X 3 X 1 = 60 arrangements. uo 


Theorem 1 
If there are 7 objects, with r, of type 1, r. of type 2,..., and r,, of type m, 
where r, + r, + +++ + 7, = n, then the number of arrangements of these n 
objects, denoted P(n; 1, ro, ... 5 Tm), 1S 
P(nsr, Pereceh Tm) = Clegg met 6s aoe ) coke (" SW Nas a =) 
r; fm) r3 lin 
n} 
= LLL * 
ry'rol 6. Tm! ) 

Proof 1 


First pick r, positions for the first types, then 7, of the remaining positions for 
the second types, and so on. A mathematically precise proof of the ‘‘etc.’’ part 
requires induction (see Exercise 27). The second line of formula (*) is just a 
simplification that results from canceling factorials in the binomial coefficients; 
for example, 


. _(6\(3\(1\_ 61. 3! 1! 6 
r16:3.2.0=(§)(3)(1) =stnaty <= son 


Proof 2 ; 
This proof is similar to our derivation of C(m, r) through the equation P(n, 
r) = C(n, r) X P(r, r). Suppose that for each type, the r; objects of type i are 
given subscripts numbered 1, 2, ..., 7; to make each object distinct. Then 
there are n! arrangements of the n distinct objects. Let us enumerate these n! 
arrangements of distinct objects by enumerating all P(737,, 72, ..., 1m) patterns 
(without subscripts) of the objects, and then for each pattern placing the sub- 
scripts in all possible ways. For example, the pattern baanna can have subscripts 
on as placed in the 3! ways: 


ba, a,nna; bana,nna; ba3a, NNa2 
ba3a,nna, ba, a3nna2 baya3nna, 


For each of these 3! ways to subscript the as, there are 2! ways to subscript 
the ns. Thus, in general a pattern will have r;! ways to subscript the r; objects 
of type 1, r,! ways for type 2, and r,,! ways for type m. Then 


nl = P(nNs Pr, fo, .- 2s Mm )iiho! .. - Tm! 
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or 


PX; rT ---5tm)= T= _, * 


Example 2 


How many different ways to select six hot dogs are possible if there are three 
varieties of hot dog? 

To solve such a selection-with-repetition problem, we recast it as an arrange- 
ment problems as follows. Suppose the three varieties are regular dog, chili 
dog, and super dog. Let a selection be written down on an order form (when 
a person places this order) 


Regular 


Each x represents a hot dog. The request shown on the form above is one 
regular, four chili, and one super. Since the hot dog chef knows that the sequence 
of dogs on the form is regular, chili, super, the request can simply be written 
as x|xxxx|x without column headings. 

Any selection of r hot dogs will consist of some sequence of r xs and two 
|s. Conversely, any sequence of r xs and two |s represents a unique selection: 
the xs before the first | count the number of regular dogs; the xs between the 
two |s count chilis; and the final xs count supers. So there is a one-to-one 
correspondence between orders and such sequences. Counting the number of 
sequences of six xs and two |s is an arrangement-with-repetition problem whose 


é _(8\f/2\_ 8 ; . , 
answer is P(8; 6, 2) = O (3) = bi As discussed in Example 5 of Section 
5.2, counting such sequences of xs and |s is simply a matter of picking the 


subset of positions where the xs (or the |s) go—again, (*) ways. @ 


Theorem 2 


The number of selections with repetition of r objects chosen from n types of 
objects is C(r + n — 1, r). 


Proof 


We make an ‘“‘order form’’ for a selection just as in Example 2, with an x for 
each object selected. As before, the xs before the first | count the number of 
the first type of object, the xs between the first and second |s count the number 
of the second type, ..., and the xs after the (n — 1)-st | count the number of 
the nth type (n — 1 slashes are needed to separate n types). The number of 
sequences with r xs andn — 1 |sisC(r + (n — 1), r). @ 
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Example 3 


Nine students, three from Ms. A’s class, three from Mr. B’s class, and three 
from Ms. C’s class, have bought a block of nine seats for their school’s 
homecoming game. If three seats are randomly selected for each class from 
the nine seats in a row, what is the probability that the three A students, three 
B students, and three C students will each get a block of three consecutive 
seats? 

In probability problems, we seek the number of favorable outcomes divided 
by all outcomes. The first question is, what is the set of all outcomes in this 
problem? They are the P(9; 3, 3, 3) = 9!/3!3!3! = 1680 ways to arrange three 
As, three Bs and three Cs in the row of nine seats. 

If the three students of each class are to sit together, then we want to count 
outcomes that are arrangements of the three blocks, AAA, BBB, and CCC. 
Thus, instead of nine letters, we really are working now with three ‘‘megalet- 
ters.’’ There are 3! = 6 ways to arrange these three megaletters. So the probabil- 
ity that each class sits together is 6/1680. = 


Example 4 


The genetic code of organisms is stored in DNA and RNA molecules as a long 
string of four nucleotides commonly represented (in RNA) by the letters A, C, 
G, U. Short strings of RNA can be ‘‘sequenced’’—the sequence of letters 
determined—by various modern biotech methods. Although the RNA sequence 
for a single gene typically has hundreds or thousands of letters, there exist 
special enzymes that will split a long string into short fragments (which can 
be sequenced) by breaking the string immediately following each appearance 
of a particular letter. 

Suppose a C-enzyme (which splits after each appearance of C) breaks a 
20-letter string into 8 fragments, which are identified to be: AC, AC, AAAUC, 
C, C, C, UAUA, UGGC. Note that each fragment, except the last one on the 
string, must end with a C. How many different strings could have given rise 
to this set of fragments? 

Since the fragment UAUA does not end with a C, it must go at the end 
of the string. The other seven fragments can occur in any order. These fragments 
consist of three Cs, two ACs, one AAAUC, and one UGGC. Treat each fragment 
as a letter, similar to the reasoning in Example 3. Then we must arrange 7 
letters, three of one type, two of a second type, and one each of two other 
types. There are thus P(7;3, 2, 1, 1) = 420 possible arrangements of the 
fragments to form a 20-letter string. = 


If we use an A-enzyme to break the same string into fragments and look 
at all the possible arrangements of these fragments and then do the same with 
a G-enzyme and a U-enzyme, there will normally be only one string that appears 
in all four sets of arrangements. This will be the true original string. More 
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sophisticated variations on this approach are used to determine the RNA 
sequence of entire genes. 


Example 5 


How many ways are there to form a sequence of 10 letters from 4 as, 4 bs, 4 
cs, and 4 ds if each letter must appear at least twice? 

To apply Theorem | we need to know exactly how many as, bs, cs, and 
ds will be in the arrangement. Thus we have to break this problem into a set 
of subproblems that each involves sequences with given numbers of as, bs, 
cs, and ds. There are two categories of letter frequencies that sum to 10 with 
each letter appearing two or more times. The first category is four appearances 
of one letter and two appearances of each other letter. The second is three 
appearances of two letters and two appearances of the other two letters. 

In the first category, there are four cases for choosing which letter occurs 
four times and P(10; 4, 2, 2, 2) = 18,900 ways to arrange four of one letter 
and two of the three others. In the second category, there are C(4,2) = 6 cases 
for choosing which two of the four letters occur three times and P(10; 3, 3, 2, 
2) = 25,200 ways to arrange three of two letters and two of the two others. 
The final answer is 4 X 18,900 + 6 X 25,200 = 226,800 ways. = 


Example 6 


How many ways are there to fill a box with a dozen doughnuts chosen from 
five different varieties with the requirement that at least one doughnut of each 
variety is picked? 

We solve this problem by following the typical procedure of people making 
such a doughnut selection. They would first pick one doughnut of each variety 
and then pick the remaining seven doughnuts any way they pleased. There is 
no choice (only one way) in picking one doughnut of each type. The choice 
occurs in picking the remaining seven doughnuts from the five types. Think 
of placing one doughnut of each type in a box and then covering them with a 
sheet of waxed paper, and then choosing the remaining seven doughnuts. The 
variety in outcomes involves only the seven doughnuts chosen to go on top of 
the waxed paper. So the answer by Theorem 2 is C(7 + 5 — 1, 7) = 330. 
(Note that the set composition principle does not apply here, because the objects 
are not all distinct.) m= 


Example 7 


How many ways are there to pick a collection of exactly 10 balls from a pile 
of red balls, blue balls, and purple balls if there must be at least 5 red balls? 
If at most 5 red balls? 

The first problem is similar to Example 6. We put 5 red balls in a box, 
cover them with waxed paper, and then select 5 more balls arbitrarily (possibly 
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including more red balls). The 5 balls above the waxed paper can be chosen 
in C(S + 3 — 1,5) = 21 ways. 


To handle the constraint of at most 5 red balls, we count the complementary 


set. Of all C110 + 3 — 1, 10) = 66 ways to pick 10 balls from the three colors 
without restriction, there are C(4 + 3 — 1, 4) = 15 ways to choose a collection 
with at least 6 red balls (put 6 red balls in a box, cover with waxed paper, and 
then arbitrarily choose 4 more balls). So there are 66 — 15 = 51 ways to 
choose 10 balls without more than 5 red balls. = 


5.3. EXERCISES 


/ Summary of Exercises Most of these problems are not too 


difficult variations on the section’s examples. Be careful to distinguish between 
arrangements and subsets problems. 


1. 


How many ways are there to roll a die six times and obtain a sequence of 
outcomes with one 1, three 5s, and two 6s? 


How many ways are there to arrange the letters in MISSISSIPPI? 
(a) How many 6-digit numbers can be formed with the digits 3, 5, and 7? 


(b) What fraction of the numbers in part (a) have two 3s, two 5s, and two 
7s? 


. How many ways are there to invite one of three different friends over for 


dinner on six successive nights such that no friend is invited more than 


three times? 
6! 6 
— + a3 6h 2 
3121 * Drove MD 3 a(S), 


Comment on the answers: (i) 


ol()+G)+G)] 


- How many ways are there to pick a collection of 10 coins from piles of 


pennies, nickels, dimes, and quarters? 


. If three identical dice are rolled, how many different outcomes can be 


recorded? 


. How many ways are there to select a committee of 15 politicians chosen 


from aroom full of indistinguishable Democrats, indistinguishable Republi- 
cans, and indistinguishable Independents if every party must have at least 
two members on the committee? If, in addition, no group may have a 
majority of the committee members? 


. Ten different people walk into a delicatessen to buy a sandwich. Four 


always order tuna fish, two always order chicken, two always order roast 
beef, and two order any of the three types of sandwich. 


15. 


16. 


17. 
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(a) How many different sequences of sandwiches are possible? 


(b) How many different (unordered) collections of sandwiches are pos- 
sible? 


. How many ways are there to pick a selection of coins from $1 worth of 


identical pennies, $1 worth of identical nickels, and $1 worth of identical 
dimes if: 


(a) You select a total of 10 coins? 
(b) You select a total of 15 coins? 


15+3-1 
15 , 


Comment on the answers: (i) ( 
a wy 15\(05 = 42=1 
an ¥(19)( (5-8 


wu. {15+3-1\) Wfd5-H+2-1 
ait ( 15 > ( (15 —k) 


. How many ways are there to pick 10 balls from large piles of (identical) 


red, white, and blue balls plus 1 pink ball, 1 lavender ball, and 1 tan ball? 


- How many numbers greater than 3,000,000 can be formed by arrangements 


of 1, 2, 2, 4, 6, 6, 6? 


How many different rth-order partial derivatives does f(x, %2,... 5 Xn) 
have? 
. How many 8-digit sequences are there involving exactly six different digits? 


- How many 9-digit numbers are there with twice as many different odd 


digits involved as different even digits (e.g., 945222123 with 9, 3, 5, 1 
odd and 2, 4 even). 


How many arrangements are possible with five letters chosen from MISSIS- 
SIPPI? 


How many ways are there to select an unordered group of eight numbers 
between | and 25 inclusive with repetition? In what fraction of these ways 
is the sum of these numbers even? 


How many ways are there to split a group of 2n as, 2n Bs, and 2n ys in 
half (into two groups of 3n letters)? (Note: The halves are unordered; there 
is no first half.) 


. How many ways are there to place nine different rings on the four fingers 


of your right hand (excluding the thumb) if: 
(a) The order of rings on a finger does not matter? 
(b) The order of rings on a finger is considered? 


. Show that > P(10; ky, /, ks) = 3, where k,, ky, k; are nonnegative integers 


ranging over alli possible triples such that k; + k, + k; = 10. 
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20. How many arrangements of six Os, five 1s, and four 2s are there in which: 
(a) The first 0 precedes the first 1? 
(b) The first 0 precedes the first 1, which precedes the first 2? 

21. How many arrangements are there of 4n letters, four of each of n types of 
letters, in which each letter is beside a similar letter? 


22. How many ways are there for 10 people to have five simultaneous telephone 
conversations? 


23. When a coin is flipped times, what is the probability that: 
(a) The first head comes after exactly m tails? 
(b) The ith head comes after exactly m tails? 

24. How many arrangements are there of TINKERER with two but not three 
consecutive vowels? 

25. How many arrangements are there of seven as, eight bs, three cs, and six 
ds with no occurrence of the consecutive pairs ca or cc? 

26. How many arrangements of 5 as, 5 Bs, and 5 ys are there with at least 
one £8 and at least one y between each successive pair of as? 

27. (a) Use induction to give a rigorous proof of Theorem 1 (Proof 1). 
(b) Use induction to prove that: 


G3] na ein oe 


5.4 DISTRIBUTIONS 


Generally a distribution problem is equivalent to an arrangement or selection 
problem with repetition. Specialized distribution problems must be broken up 
into subcases that can be counted in terms of simple permutations and combina- 
tions (with and without repetition). General guidelines for modeling distribution 
problems are: Distributions of distinct objects are equivalent to arrangements 
and Distributions of identical objects are equivalent to selections. 


Basic Models for Distributions 


Distinct Objects The process of distributing r distinct objects into n different 
boxes is equivalent to putting the distinct objects in a row and stamping one 
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of the n different box names on each object. The resulting sequence of box 
names is an arrangement of length r formed from n items (box names) with 
repetition. Thus there aren X n X ... X n (rns) = n’ distributions of the r 
distinct objects. If r; objects must go in box i, 1 S i = n, then there are 
P(r3 ri, Yo «+5 Yn) distributions. 


Identical Objects The process of distributing r identical objects into n 
different boxes is equivalent to choosing an (unordered) subset of r box names 
with repetition from among the n choices of boxes. Thus there are C(r + n — 
l,r) = (7 +n -— 1)!/r!(@ — 1)! distributions of the r identical objects. 


Example 1 


How many ways are there to assign 100 different diplomats to five different 
continents? How many ways if 20 diplomats must be assigned to each continent? 

According to the model for distributions of distinct objects, this assignment 
equals the number of sequences of length 100 involving five continental destina- 
tions—5'™ such sequences (think of the diplomats lined up in a row holding 
attaché cases stamped with their destination). The constraint that 20 diplomats 
go to each continent means that each continent name should appear 20 times 
in the sequence. This can be done P(100; 20, 20, 20, 20, 20) = 100!/(20!)° 
ways. w 


Example 2 


In bridge, the 52 cards of a standard card deck are randomly dealt 13 apiece 
to players North, East, South, and West. What is the probability that West has 
all 13 spades? That each player has one Ace? 

There are P(52; 13, 13, 13, 13) distributions of the 52 cards into four 
different 13-card hands, using the same reasoning as in the preceding example. 
Distributions in which West gets all the spades may be counted as the ways 
to distribute to West all the spades—1 way—times the ways to distribute the 
39 non-spade cards among the 3 other hands—P(39; 13, 13, 13) ways. So the 
probability that West has all the spades is 


39! iy ieee ee) ae /(55) 
(13!)7 131)" 13139! 13 

The simple form of this answer can be directly obtained by considering 
West’s possible hands alone (ignoring the other three hands). A random deal 
gives West one of the C(52, 13) possible 13-card hands. Thus, the unique hand 
of 13 spades has probability 1/C(52, 13). 

To count the ways in which each player gets one Ace, we divide the 
distribution up into an Ace part—4! ways to arrange the 4 Aces among the 
four players—-and a non-Ace part—P(48;12,12,12,12) ways to distribute the 
remaining 48 non-Ace cards, 12 to each player. So the probability that each 
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player gets an Ace is 


4148! ij 52! _ 13! 44st, i 52 
aa Gah EE =0.1 
(12!)*7 (131) 12! «52! iS 4 05 = 


Example 3 


How many ways are there to distribute 20 (identical) sticks of red licorice and 
15 (identical) sticks of black licorice among five children? 

Using the identical objects model for distributions, we see that the ways 
to distribute 20 identical sticks of red licorice among five children is equal to 
the ways to select a collection of 20 names (destinations) from a set of 5 
different names with repetition. This can be done C(20 + 5 — 1, 20) = 10,626 
ways. By the same modeling argument, the 15 identical sticks of black licorice 
can be distributed in C(1S + 5 — 1, 15) = 3876 ways. The distributions of 
red and of black licorice are disjoint procedures. So the number of ways to 
distribute red and black licorice is 10,626 X 3876 = 41,186,376. m= 


Example 4 


Show that the number of ways to distribute r identical balls into n distinct 
boxes with at least one ball in each box is C(r — 1, n — 1). With at least r, 


balls in the first box, at least r, balls in the second box, ... , and at least r, 
balls in the nth box, the number is C(r — 1) — 1. — +++ — rm, tn 1], 
n-— 1). 


The requirement of at least one ball in each box can be incorporated into 
the equivalent selection-with-repetition model (as in the doughnut selection 
problem in Section 5.3). One could select a collection of labels of the box 
destinations by putting one label for each of the n boxes on a tray, covering 
these labels with a piece of waxed paper, and the picking the remaining r — 
n labels in all possible ways. 

Alternatively, we can handle this constraint directly in terms of the boxes 
as follows. First put one ball in each box and then put a false bottom in the 
boxes to conceal the ball in each box. Now it remains to count the ways to 
distribute without restriction the remaining r — n balls into the n boxes. This 
can be done in 


[r-—n)t+n-— 1}! 


Cir -—n)tan-i,7r—n)= = C(r — 1,n— 1) ways 


(r—n)'(n- 1)! 

In the case where at least 7; balls must be in the ith box, we first put r; 
balls in the ith box, and then distribute the remaining r — r,; — rr. —--- — 1% 
balls in any way into the n boxes. This can be done in 
CGR EERO) Haale ae eS) 


=C(r-—n-—-nm—-+++— rm) +n—1,1n- 1) ways oa 
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The next counting problem will play a critical role in building generating 
functions in the next chapter. 


Example 5 


How many integer solutions are there to the equation x, + x) + x + x, = 12, 
with x; 2 0? How many solutions with x, = 1? How many solutions with 
x 2=2,m22,%24,x, = 0? 

By an integer solution to this equation, we mean an ordered set of integer 
values for the x;s summing to 12, such as x, = 2, x, = 3, x; = 3, x, = 4. We 
can model this problem as a distribution-of-identical-objects problem or as a 
selection-with-repetition problem. Let x; represent the number of (identical) 
objects in box i or the number of objects of type i chosen. Using either of these 
models, we see that the number of integer solutions is C(12 + 4 — 1, 12) = 
455. 

Solutions with x; = 1 correspond in these models to putting at least one 
object in each box or choosing at least one object of each type. Solutions with 
X, 2 2, x, 2 2, x; = 4, x, = 0 correspond to putting at least two objects in 
the first box, at least two in the second, at least four in the third, and any 
number in the fourth (or equivalently in the selection-with-repetition model). 
Formulas for these two types of distribution problems were given in Example 
4. The respective answers are C(12 — 1, 4 — 1) = 165 and C((12 — 2 — 
2-4)+4-1,4- 1) =C(7, 3) = 35. = 


We have now encountered three equivalent forms for selection with repeti- 
tion problems. 


Equivalent Forms for Selection with Repetition 


The number of ways to select r objects with repetition from n 
different types of objects. 


The number of ways to distribute r identical objects into n distinct 
boxes. 


The number of nonnegative integer solutions to x; + x +++: + 
Xn = Pr. 


It is important that the reader be able to restate a problem given in one of 
the foregoing settings in the other two. Many students find version 2 the most 
convenient way to look at such problems because a distribution is easiest to 
picture on paper (or in one’s head). Indeed, the original argument with order 
forms for hot dogs used to derive our formula for selection with repetition was 
really a distribution model. Version 3 is the most general (abstract) form of 
the problem. It is the form needed in Chapter 6 to build generating functions. 
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We next present four problems that we solve by recasting as problems 
involving distributions of identical or distinct objects. 


Example 6 


How many ways are there to distribute four identical oranges and six distinct 
apples (each a different variety) into five distinct boxes? In what fraction of 
these distributions does each box get a total of two objects? 

There are C(4 + 5 — 1, 4) = 70 ways to distribute four identical oranges 
into five distinct boxes and 5° = 15,625 ways to put six distinct apples in 
five distinct boxes. These two processes are disjoint, and so there are 70 X 
15,625 = 1,093,750 ways to distribute the four identical and six distinct fruits. 

The additional constraint of two objects in each box substantially compli- 
cates matters. To count constrained distributions of distinct objects, we must 
know exactly how many of the distinct objects should go in each box. But in 
this problem, the number of distinct objects that can go in a box depends on 
how many identical objects are in the box. So we deal with the (identical) 
oranges first. There are three possible categories of distributions of the four 
oranges (without exceeding two in any box). 


Case 1 Two (identical) oranges in each of two boxes and no oranges in the 
other three boxes. The two boxes to get the pair of oranges can be chosen in 
C(5, 2) = 10 ways, and the six (distinct) apples can then be distributed in those 
three other boxes, two to a box, in P(6; 2, 2, 2) = 90 ways. So Case | has 
10 X 90 = 900 possible distributions. 


Case 2 Two (identical) oranges in one box, one orange in each of two other 
boxes, and the remaining two boxes empty. The one box with two oranges can 
be chosen C(5, 1) = 5 ways, the two boxes with one orange can be chosen in 
C(4, 2) = 6 ways [or combining these two steps, we can think of arranging 
the numbers 2, 1, 1, 0, 0, among the five boxes in P(5; 1, 2, 2) = 30 ways]. 
Then the two boxes still empty will each get two apples and the two boxes 
with one orange will get one apple. Thus, six distinct apples can then be 
distributed in P(6; 2, 2, 1, 1) = 180 ways, for a total of S X 6 X 180 = 5400 
ways. 


Case 3 One orange in each of four boxes. This can be done in C(5, 4) = 5 
ways, and then the apples can be distributed in P(6; 2, 1, 1, 1, 1) = 360 ways, 
for a total of 5 X 360 = 1800 ways. 


Summing these cases, there are 900 + 5400 + 1800 = 8100 distributions 
with two objects in each box. The fraction of such distributions among all ways 
to distribute the four oranges and six apples is 8100/1,093,750 = .0074—a lot 
smaller than one might guess. (If all 10 fruits were distinct, the fraction with 
two in-each box would be about 50 percent larger. Why?) = 
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Example 7 


“A warlock goes to a store with one dollar to buy ingredients for his wife’s 
Witch’s Brew. The store sells bat tails for 5¢ apiece, lizard claws for 5¢ apiece, 
newt eyes for 5¢ apiece, and calf blood for 20¢ a pint bottle. How many 
different purchases (subsets) of ingredients will this dollar buy? 

The first step is to make our unit of money 5¢ (a nickel). So the warlock 
has 20 units to spend with blood costing four units and the other three items 
each costing one unit. An integer-solutions-of-equation model of this problem 
is 


T+S+E+ 4B = 20, T,S,E,B=0 


The simplest way to handle the fact that B has a coefficient of 4 is first to 
specify exactly how much blood is bought. If one pint is bought, B = 1, then 
we have T + S + E = 16, an equation with Cd6 + 3 — 1, 16) = 153 
nonnegative integer solutions. In general, if B = i, then we have T+ S + E 
= 20 — 4i, an equation with C((20 — 4i) + 3 — 1, 20 — 4i) = C(22 — 4i, 
2) solutions, for i = 0, 1, 2, 3, 4, 5. Summing these possibilities, we obtain 


22 18 14 10 6 2\_ : 
(?) + ) + ee + & + (5) + (3) = 536 different purchases wm 


Many automated production processes are controlled by inputs that are 
binary sequences of Os and 1s. Binary sequences of certain types can often be 
Processed very fast. It is thus useful to know how frequent such ‘‘nice’’ 
sequences are. The sequences in the next example are the type that are often 
handled very fast. 


Example 8» 


How many binary sequences of length 10 are there consisting of a (positive) 
number of 1s, followed by a number of Os, followed by a number of 1s, followed 
by a number of Os? An example of such a sequence is 1110111000. 

We model this problem as a distribution problem as follows. We create 
four distinct boxes, the first box for the initial set of 1s, the second box for the 
following set of Os, and so on. We have 10 identical markers (call them xs) to 
distribute into the four boxes. Each box must have at least one marker, since 
each subsequence of Os or of 1s must be nonempty. By Example 4, the number 
of ways to distribute 10 xs into 4 boxes with no box empty is C(10 — 1, 4 — 
1) = 84. So there are 84 such binary sequences. um 


Example 9 » 


How many arrangements of the letters a, €, t, O, U, X, X, X, X, X, X, X, x, (eight 
xs) are there if no two vowels can be consecutive? 

First we count the ways to fill in the xs before, between, and after the five 
vowel positions. We require at least one x between successive vowels. Think 
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of arrangements of 8 xs and 5 us. The situation can be modeled by creating 
boxes before, between, and after the vs and distributing the xs into the six 
resulting boxes with at least one x in the middle four boxes (to assure no 
two vowels are consecutive). The following pattern illustrates one possible 
distribution of the xs: 


box 1 box 2 box 3 box 4 box 5 box 6 


Initially we put one x in boxes 2, 3, 4, 5. Then we distribute the remaining 
four xs into the six boxes without constraint—C(4 + 6 — 1, 4) = 126 ways. 
This counts all possible patterns of xs and vs (the positions of the us are forced 
by the way we choose the xs). 

Next in each pattern, we arrange a, e, i, 0, u in 5! = 120 ways among the 
five positions with vs. In total, there are 120 X 126 = 15,120 arrangements. = 


We close this section with a table summarizing the different basic types 
of counting problems we have encountered in this chapter. 


Ways to Arrange, Select, or Distribute r Objects from n Items 
or into n Boxes 


Arrangement Combination 
(ordered outcome) (unordered outcome) 
or or 

Distribution of - Distribution of 
distinct objects identical objects 

No repetition P(n, r) C(n, r) 

Unlimited repetition n Cun+r—i,r) 

Restricted Repetition  P(n; r1, 1, ..~ Tm) — 


5.4 EXERCISES 


Mb Summary of Exercises Be careful to distinguish whether a 


problem involves distinct or identical objects. Exercises 23-26 involve problem 
restatement (no actual numerical answers are sought). Exercise 37 presents an 
important alternative approach for analyzing nonconsecutivity problems. 


1. How many\ways are there to distribute 40 identical jelly beans among four 
children: 
(a) Without restrictions? 
(b) With each child getting 10 beans? 
(c) With each child getting at least 1 bean? 
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. How many ways are there to distribute 20 different toys among five chil- 


dren? 

(a) Without restrictions? 

(b) If two children get 7 toys and three children get 2 toys? 
(c) With each child getting 4 toys? 


. In‘a bridge deal, what is the probability that: 


(a) West has four spades, three hearts, three diamonds, and three clubs? 


(b) North and South have five spades, West has two spades, and East has 
one spade? 


(c) One player has all the Aces? 
(d) All players have a (4, 3, 3, 3) division of suits? 


. How many ways are there to distribute five (identical) apples, six oranges, 


and seven pears: 

(a) Without restriction? 

(b) With each person getting at least one pear? 

How many ways are there to distribute 18 chocolate doughnuts, 12 cinna- 


mon doughnuts, and 14 powdered sugar doughnuts among four school 
principals if each principal demands at least 2 doughnuts of each kind? 


How many distributions of 24 different objects into three different boxes 
are there with twice as many objects in one box as in the other two 
combined? 


- How many ways are there to arrange the letters in VISITING with no pair 


of consecutive Is? 


- How many ways are there to arrange the 26 letters of the alphabet so that 


no pair of vowels appear consecutively (Y is considered a consonant)? 


. If you flip a coin 20 times and get 14 heads and 6 tails, what is the 


probability that there is no pair of consecutive tails? 


. How many integer solutions are there to xX + xX + x3 + xy + x5 = 28 


with: 
(a) x, = 0 (b) x; > 0 (c) x = id = 1, 2, 3, 4,5) 


- How many integer solutions are there to xX +X» + x3, = 0 with x, = —5? 


- How many positive integer solutions are there to Xt xy + + xy < 


100? 


- How many ways can a deck of 52 cards be broken up into a collection of 


unordered piles of sizes: 
(a) Four piles of 13 cards? 
(b) Three piles of 8 cards and four piles of 7 cards? 
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14. 


15. 


16. 


17. 


18. 


19 


20. 


21. 


22. 


23. 


How many ways are there to distribute k balls into n distinct boxes (k < 
n) with at most one ball in any box if: 


(a) The balls are distinct? 

(b) The balls are identical? 

How many ways are there to\dishribute three different teddy bears and nine 
identical lollipops to four children: 

(a) Without restriction? 

(b) With no child getting two or more teddy bears? 

(c) With each child getting three ‘‘goodies’’? 

What fraction of all arrangements of EFFLORESCENCE has consecutive 
Cs and consecutive Fs but no consecutive Es? 

How many arrangements of MISSISSIPPI are there with no pair of consecu- 
tive Ss? 

How many ways are there to distribute 15 identical objects into four different 
boxes if the number of objects in box 4 must be a multiple of 3? 


If n distinct objects are distributed randomly into n distinct boxes, what is 
the probability that: 


(a) No box is empty? 

(b) Exactly one box is empty? 

(c) Exactly two boxes are empty? 

How many ways are there to distribute eight balls into six boxes with the 
first two boxes collectively having at most four balls if: 

(a) The balls are identical? 

(b) The balls are distinct? 

In Example 6, if all 10 pieces of fruit were distinct, what would be the 


fraction of outcomes with two pieces of fruit in each box? Why is this 
fraction greater when the pieces of fruit are distinct? 


(a) How many ways are there to make 35 cents change in 1952 pennies, 
1959 pennies, and 1964 nickels? 


(b) In 1952 pennies, 1959 pennies, 1964 nickels, and 1971 quarters? 
State an equivalent distribution version of each of the following arrangement 
problems: 

(a) Arrangements of 8 letters chosen from piles of as, bs, and cs. 

(b) Arrangements of 2 as, 3 bs, 4 cs. 


(c) Arrangements of 10 letters chosen from piles of as, bs, cs, and ds 
with the same number of as and bs. 


(d) Arrangements of 4 letters chosen from 2 as, 3 bs, and 4 cs. 
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24, State an equivalent arrangement version of each of the following distribution 


25. 


26 


27, 


28. 


29. 


problems: 
(a) Distributions of n distinct objects into n distinct boxes. 


(b) Distributions of 15 distinct objects into five distinct boxes with 3 
objects in each box. 


(c) Distributions in 15 distinct objects into five boxes with i objects in the 
ith box, i = 1, 2, 3, 4, 5. 


(d) Distributions of 12 distinct objects into three distinct boxes with at 
most 3 objects in box | and at most 5 objects in box 2. 


State an equivalent distribution version and an equivalent integer-solution- 
of-an-equation version of the following selection problems. 


(a) Selections of six ice cream cones from 31 flavors. 


(b) Selections of five marbles from a group of five reds, four blues, and 
two pinks. 


(c) Selections of 12 apples from four types of apples with at least 2 apples 
of each type. 


(d) Selections of 20 jelly beans from four different types with an even 
number of each type and not more than 8 of any one type. 


State an equivalent selection version and an equivalent integer-solution- 
of-an-equation version of the following distribution problems: 


(a) Distributions of 30 black chips into five distinct boxes. 


(b) Distributions of 18 red balls into six distinct boxes with at least 2 balls 
in each box. 


(c) Distributions of 20 markers into four distinct boxes with the same 
number of markers in the first and second boxes. 


How many election outcomes are possible (numbers of votes for different 
candidates) if there are three candidates and 30 voters? If, in addition, some 
candidate receives a majority of the votes? 


How many election outcomes in the race for class president are there if 
there are five candidates and 40 students in the class and 
(a) Every candidate receives at least two votes? 


(b) One candidate receives at most one vote and all the others receive at 
least two votes? 


(c) No candidate receives a majority of the votes? 
(d) Exactly three candidates tie for the most votes? 


How many numbers between 0 and 10,000 have a sum of digits: 
(a) Equal to 7? (b) Less than or equal to 7? (c) Equal to 13? 
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30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


41. 


How many integer solutions are there to the equation x, + %) + %3 + 
x, = 15 with x; = —10? 

How many nonnegative integer solutions are there to the equation 2x, + 
2X, + X3 + X= 12? 

How many nonnegative integer solutions are there to the pair of equations 
xX) $X t+ + +X = 20 and x +x, +%,= 7? 

How many nonnegative integer solutions are there to the inequalities x, + 
xy +++ $+ X56 S 20 and x, + y+ =7? 

How many nonnegative integer solutions are there tox, +H tcc + 
xs = 20: 

(a) With x;= 10? (b) With x; = 8? (c) With x, = 2x2? 
How many ways are there to split four red, five blue, and seven black balls 
among: 

(a) Two boxes without restriction? 

(b) Two boxes with no box empty? 


(a) How many arrangements are there of REVISITED with vowels not in 
increasing order—that is, an I before one (or both) of the Es? 

(b) How many arrangements with no consecutive Es and no consecutive 
Is? 

(c) How many arrangements with vowels not in increasing order and no 
consecutive Es and no consecutive Is? 


This exercise presents an alternative approach to counting arrangements 
with a certain type of object not occurring consecutively. Take Exercise 
7, which seeks arrangements of VISITING with no consecutive Is. Arrange 
the 5 consonants in 5! ways and then pick a subset of 3 positions (with 
no repetition) for Is from among the 6 locations before, between, and after 
the 5 consonants—done in C(6, 3) ways. 


(a) Rework Example 9 using pokes 
(b) Use this approach to solve Exercise 8. 

What fraction of all arrangements of WISCONSIN without any pair of 
consecutive vowels have W adjacent to an I? 

How many bridge deals are there in which North and South get all the 
spades? 


What is the probability in a bridge deal that each player gets at least three 
honors (an honor is an Ace, or King, or Queen, or Jack)? 


How many ways are there to distribute 15 distinct oranges into three 
different boxes with at most 8 oranges in a box? 
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42. How many ways are there to distribute 20 toys to m children such that the 
first two children get the same number of toys if: 


(a) The toys are identical? (b) The toys are distinct? 


43. How many ways are there to distribute 25 different presents to four people 
(including the boss) at an office party so that the boss receives exactly 
twice as many presents as the second most popular person? 


44, How many ways are there to distribute r identical balls into n distinct boxes 
with exactly m boxes empty? 


45. How many subsets of six integers chosen (without repetition) from 1, 2, 
..., 20 are there with no consecutive integers (e.g., if 5 is in the subset, 
then 4 and 6 cannot be in it)? 

46. How many arrangements are there of eight as, six Bs, and seven ys in 
which each a is beside (on at least one side) another a? (Hint: Watch out 
for two clusters of as occurring consecutively.) 

47. How many arrangements are there with n Os and m 1s, with k runs of 0s? 
[A run is a consecutive set (1 or more) of the same digit; e.g., 0001110100 
has three (underlined) runs of Os.] 

48. What is the probability that a random arrangement of a deck of 52 cards 

has exactly k runs of Hearts (see Exercise 47 for a definition of a run)? 

How many binary sequences of length n are there that contain exactly m 

occurrences of the pattern 01? 

50. How many ways are there to distribute 20 distinct flags onto 12 distinct 
flagpoles if: 


49 


(a) In arranging flags on a flagpole, the order of flags from the ground up 
makes a difference. 


(b) No flagpole is empty and the order on each flagpole is counted? 
51. How many ways are there to distribute r identical balls into n distinct boxes 
with the first m boxes collectively containing at least s balls? 


52. How many ways are there to deal four cards to each of 13 different players 
so that exactly 11 players have a card of each suit? 


—=—enr_ Sp 


5.5 BINOMIAL IDENTITIES 


In this section we show why the numbers C(n, r) are called binomial coefficients. 
Then we present some identities involving binomial coefficients. We use three 
techniques to verify the identities: combinatorial selection models, ‘‘block 
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walking,’’ and the binomial expansion. The study of binomial identities is itself 
a major subfield of combinatorial mathematics and we will necessarily just 
scratch the surface of this topic. The interested reader is referred to Riordan 
[5] for a more thorough introduction to combinatorial identities. 

Consider the polynomial expression (a + x)’. Instead of multiplying (a + 
x) by (a + x) and the product by (a + x) again, let us formally multiply the 
three factors term by term: 


(a+ x)(a + x)(a + x) = aaa + aax + axa + axx + xaa + xax + XxX + XXX 


Collecting similar terms, we reduce the right-hand side of this expansion 
to 


at 3a*x + 3ax?+ x3 (1) 


The formal expansion of (a + x)’ was obtained by systematically forming all 
products of a term in the first factor, a or x, times a term in the second factor 
times a term in the third factor. There are two choices for each term in such 
a product, and so there are 2 X 2 X 2 = 8 formal products (8 products were 
obtained above). If we were expanding (a + x)'°, we would obtain 2'” = 1024 
different formal products. 

Now we ask the question, how many of the formal products in the expansion 
of (a + x) contain k xs and (3 — k) as? This question is equivalent to asking 
for the coefficient of a*‘x* in (1). Since all possible formal products of as and 
xs are formed, and since formal products are just three-letter sequences of as 
and xs, we are simply asking for the number of all three-letter sequences with 
k xs and (3 — k) as. The answer is C(3, k) and so the reduced expansion for 
(a + x) can be written as = 


3\ 34 (3) 3) 24 (3),3 
(3) 2 + (3) ar (3) ax + 3 x 
By the same argument, we see that the coefficient of a”‘x‘ in (a + x)" will be 


equal to the number of n-letter sequences formed by k xs and (n — k) as, that 
is, C(n, k). If we set a = 1, we have the following theorem. 


Binomial Theorem 


(+xy= (3) + ()x+ (3) tg (i) Ee ae 


Another proof of this expansion, using induction, is given in the Exercises. 
Just as important as the Binomial Theorem is the equivalence we have estab- 
lished between the number of k-subsets of n objects and the coefficient of x* 
in (1 + x)". We exploit this equivalence extensively in the next chapter. 
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Let us now consider some basic properties of binomial coefficients. The 
most important identity for binomial coefficients is the symmetry identity 


n\ _ n! _f{ on 
a “a= pt” (, e ) (2) 


In words, this identity says that the number of ways to select a subset of k 
objects out of a set of n objects is equal to the number of ways to select a 
group of n — k of the objects to set aside (not to be in the subset). 

The other fundamental identity is 


n n-1 n-1 
(t) ("5") +@=) 
This identity can be verified algebraically. We will give a combinatorial argu- 
ment instead. We classify the C(n, k) committees of k people chosen from a 
set of n people into two categories, depending on whether or not the committee 
contains a given person p. If p is not part of the committee, there are C(n — 
1, k) ways to form the committee from the other n — 1 people. On the other 
hand, if p is on the committee, the problem reduces to choosing the k — 1 
remaining members of the committee from the other n — 1 people. This can 
be done C(n — 1, k — 1) ways. Thus C(n, k) = C(n — 1, kh + C(n — 1, 
k— 1). 
The following example presents another binomial identity that can be 
verified algebraically or by a combinatorial argument. 


Example 1 
Show that 


(7) (") 7 (m)(i 3 :) (4) 


The left-hand side of Eq. (4) counts the ways to select a group of k people 
chosen from a set of n people and then to select a subset of m leaders within 
the group of k people. Equivalently, as counted on the right side, we could first 
select the subset of m leaders from the set of people and then select the 
remaining k — m members of the group from the remaining n — m people. 
Note the special form of Eq. (4) when m = 1: 


(enirt) oe) 


Using Eq. (3) and the fact that C(n, 0) = C(n, n) = 1 for all nonnegative 
n, we can recursively build successive rows in the following table of binomial 
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coefficients, called Pascal’s triangle. Each number in this table, except the 
first and last numbers in a row, is the sum of the two neighboring numbers in 
the preceding row. 


Table of binomial coefficients: kth number in row n is (") 


at ae 
: aver a 
= Paar a 


LOPLI 


n= 


Pascal’s triangle has the following nice combinatorial interpretation, due 
to George Polya. Consider the ways a person can traverse the blocks in the 
map of streets shown in Figure 5.1. The person begins at the top of the network, 
at the spot marked (0, 0), and moves down the network (down the page) making 
a choice at each intersection to go right or left (for simplicity, let “‘right’’ be 
your right as you look at this page, not the right of the person moving down 
the network). We label each street corner in the network with a pair of numbers 
(n, k), where n indicates the number of blocks traversed from (0, 0) and k the 
number of times the person chose the right branch at intersections. Figure 5.1 
shows a possible route from the start (0, 0) to the corner (6, 3). 

Any route to corner (n, k) can be written as a list of the branches (left or 
right) chosen at the successive comers on the path from (0, 0) to (n, &). Such 
a list is just a sequence of k Rs (right branches) and (n — k) Ls (left branches). 
To go to comer (6, 3) following the route shown in Figure 5.1, we have the 
sequence of turns LLRRRL. 

Let s(n, k) be the number of possible routes from the start (0, 0) to corner 
(n, k) (moving down in the network). This is the number of sequences of k Rs 
and (n — k) Ls, and so s(n, k) = C(n, &). Another useful interpretation of 
binomial coefficients is the committee selection model, used above to verify 
Egs. (3) and (4). 

Let us show how our ‘‘block-walking’’ model for binomial coefficients 
can be used to get an alternate proof of identity (3). At the end of a route from 
the start to corner (n, k), a block walker arrives at (n, k) from either corner 
(n — 1, & or corner (n — 1, k — 1). For example, to get to corner (6, 3) in 
Figure 5.1, the person either goes to corner (5, 3) and branches left to (6, 3), 
or goes to corner (5, 2) and branches right to (6, 3). We have thus verified 
identity (3): s(n, ) = s(n — 1, + s(n — 1,k - 1). 


7 
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JER 
‘ cK» 


Figure 5.1 


We now list seven well-known binomial identities and verify three of them 
(the others are left as exercises). These binomial identities are of much practical 
interest. Expressions involving sums and products of binomial coefficients arise 
frequently in complicated real-world counting problems. These identities can 
be used to simplify such expressions. 


()-()+G)-+G)-» 
ete eee 
OC Q)-(h) © 
W)+GY+~-G)-G) ® 
CN) (02) ig 
(54) (559) z 
ae 


(12) 
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Here C(n, r) = 0 if 0 = n < r. These identities can be explained by the 
‘“‘committee’’ type of combinatorial argument used for Eqs. (2), (3), and (4) 
or by ‘‘block-walking arguments.’’ We give a committee argument for Eq. (6). 
Consider the following two ways of counting all subsets (of any size) of a set 
of n people: (a) summing the number of subsets of size 0, of size 1, of size 2, 
and so on—this yields the left-hand side of Eq. (6)—and (b) counting all 
subsets by whether or not the first person is in the subset, whether or not the 
second person is in the subset, and so on—this yields 2X 2X2...xX2(n 
times) = 2", the right-hand side of Eq. (6). 

In general, a combinatorial argument for proving such identities consists 
of specifying a collection counted by the term on the right side, partitioning 
the collection into classes, and showing that the terms in the summation count 
the classes. 

The advantage of proofs involving block walking is that we can draw 
pictures of the ‘‘proof.’’ The picture shows that all the routes to a certain 
corner—this amount is the right-hand side of the identity—can be decomposed 
in terms of all routes to certain intermediate corners (or blocks)—the sum on 
the left-hand side of the identity. 


Example 2 


Verify identity (8) by block-walking and committee selection arguments. 

As an example of this identity, we consider the case where r = 2 and 
n = 6. The corners (k, 2), k = 2, 3, 4, 5, 6, are marked with a * in Figure 5.1 
and comer (7, 3) is marked with an o. Observe that the right branches at each 
starred comer are the locations of last possible right branches on routes from 
the start (0, 0) to corner (7, 3). After traversing one of these right branches, 
there is just one way to continue on to corner (7, 3), by making all remaining 
branches left branches. In general, if we break all routes from (0, 0) to (n + 
1,r + 1) into subcases based on the corner where the last right branch is taken, 
we obtain identity (8). 

We restate the block-walking model as a committee selection: If the kth 
turn is right, this corresponds to selecting the kth person to be on the committee; 
if the kth turn is left, the kth person is not chosen. We break the ways to pick 
r + 1 members of a committee from n + 1 people into cases depending on 
who is the last person chosen: the (r + 1)st, the (7 + 2)nd, ..., the (7 + 1)st. 
If the (r + k + 1)st person is the last chosen, then there are C(r + k, r) ways 
to pick the first r members of the committee. Identity (8) now follows. = 


Example 3 


Verify identity (9) by a block-walking argument. 

The number of routes from (n, k) to (2n, n) is equal to the number of routes 
from (0, 0) to (n, n — k), since both trips go a total of n blocks with n — k to 
the right (and & to the left). So the number of ways to go from (0, 0) to (n, k) 
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and then on to (2n, n) is C(n, k) X C(n, n — k). By (2), C(n, n — k) = C(a, 
k), and thus the number of routes from (0, 0) to (2n, n) via (n, k) is Cn, ky. 
Summing over all k—that is, over all intermediate corners n blocks from the 
start—we count all routes from (0, 0) to (2n, n). This sum equals C(2n, 7), 
and so identity (9) follows. = 


Now we show how binomial identities can be used to evaluate sums whose 
terms are closely related to binomial coefficients. 


Example 4 


Evaluate the sum 1X 2X3+4+2xX3X4+---+(n— 2)(n—- 1)n. 

The general term in this sum (k — 2)(k — Dk i is equal to P(k, 3) = k!/(k 
— 3)!. Recall that the number of zpermutations, and of r-selections differ by 
a factor of r! That is, C(k, 3) = k!/(k — 3)!3! = P(k, 3)/3!, or P(k, 3) = 3!C(k, 
3). So the given sum can be rewritten as 


(3) +3(5) +---+3(8)-21(@) +8) +--+) 


By identity (8), this sum equals 31(” y'): aw 


Example 5 


Evaluate the sum 1? + 2? + 3?+--- + n?, 

A strategy for problems whose general term is not a multiple of C(n, k) 
or P(n, k) is to decompose the term algebraically into a sum of P(n, k)-type 
terms. In this case, the general term k’ can be written as kK? = k(k — 1) + k. 
So the given sum can be rewritten as 


[1 x 0) + 1) + [(2 X 1) +2] + [3X 2)4+ 3] +--+ +[nm—-1) 47] 
=[(22X 1) +(3X2)++--+na—-1D)]+014+24+34+--+-4+n) 


= (262) +2(2) + +2(3)) (i) +()-+(0) 
23") #("3") 


by identity (8). 

Note that as part of Example 5, we showed that 1 + 2+ 3+ --- + 
n=C(n + 1, 2) = 3n(n + 1), a result we verify by induction in Example 1 
of Appendix A.2. = 


There is another simple way to verify identities with binomial coefficients. 
We start with the binomial expansion in the Binomial Theorem. Then we 
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substitute appropriate values for x. The following identities can be obtained 
from the binomial expansion: Eq. (6) by setting x = 1, Eq. (13) by setting 
x = —l, and Eq. (14) by differentiating both sides of the binomial expansion 
and setting x = 1. 


comes ()+Qje-+G) © 
eee 00 aac 


G)GQm-()eGeore on 


(since the sum of both sides is 2”) 


nl +x)jr! = (1) + 2()x + a3 )x +.+-+yn (en (14) 
n(l + 1! =n! = 1(") + 2(5) + a(") fee tn (") (14’) 


5.5 EXERCISES 


Mo Summary of Exercises Combinatorial identities are a well- 


developed field whose surface was barely scratched in this section. The later 
problems in this exercise set go well beyond the level of examples worked in 
this section. Interested students should consult Riordan [5] for assistance with 
the harder problems. 


or 


and so 


1. (a) Verify identity (3) algebraically (writing out the binomial coefficients 
in factorials). 


(b) Verify identity (4) algebraically. 
2. Verify the following identities by block walking. 
(a) (6) (b) (7) (c) (10) (d) (11) 
3. Verify the following identities by a committee selection model. 


(a) (7) (b) (9) (c) (10) (d) (il) (e) (13) 


eo OO ~sN Mw 


14, 
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. Verify the following identities by mathematical induction [Hint: Use 


Eq. (3)]. 
(a) (3) (b) (5) (c) (6) (d) (7) (e) (13’) 


. Prove the binomial theorem by mathematical induction. 

- Show that identity (7) can be obtained as a special case of Eq. (11). 
. Show that C(2n, n) + C(2n, n — 1) = 3C(2n + 2, + 1). 

. If Cin, 3) + Cin + 3 — 1, 3) = Pa, 3), find n. 


. Show by a combinatorial argument that 


n+r-1\f{n\ {n+r—1\f2r 
we AlN) VE) 


. Show that C(k + m + n, k)C(m + n,m) = (k + m + n)\(kimtn!). 


n n n\ _ 
. (a) Show that (") + 6() + o(') =n 


(b) Evaluate 1° + 27+ 33 +--+ + 73, 


. (a) Evaluate 5)12(k + 1)k(k — 1). (Hint: Use Example 4.) 
k=0 


(b) Evaluate 5\(2 + 3k). 
k=0 


(c) Evaluate S* k(n — &). 
k=0 


. (a) Evaluate the sum 


142(t)+--@e p(t) +e ter n(") 


by breaking this sum into two sums, each of which is an identity in 
this section. 


(b) Evaluate the sum 


()-2() +@)+26) + 


By setting x equal to the appropriate values in the binomial expansion (or 
one of its derivatives, etc.) evaluate: 
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(a) > coi’) > -yn(") 
k=0 k=1 , 
- n ~ 1 (n 
(b) 2, Hee - (7) © Qt (7) 
© 3a(") @ See + v{%) 
k=0 k=0 
n i. n 
(d) p> 3 ( ) 


n k 7 n+l = m 
15. Show that 7 (*) - ba 7 Bs _ 


16. Show that >) oe => Gay 
k=1 k k=1 k 


P(m+n,m+t 1) 


+ = 
17. Show that p> P(m + k, m) m+) 


18. (a) Consider a sequence of 2n distinct people in a line at a cashier. Suppose 
n of the people owe $1 and n of the people are due a $1 payment. 
Show that the number of arrangements in which the cashier never goes 
into debt (i.e., at every stage at least as many people have paid in $1 


2n 


: F 2n 
as were paid out $1) is equal to (>") (, 41 


) (Hint: Use a 


symmetry argument in a block-walking type of model to demonstrate 
a one-to-one correspondence between sequences where at some stage 
the cashier goes (at least) $1 into debt and all sequences of 2n people 


in which n + 1 of the people are owed $1.] 


(b) Repeat part (a) with the following constraint: If the first person pays 
$1, how many arrangements of the people are possible in which the 
cashier always has a positive amount of money until the last person 


in line (who is owed $1)? 


19. Show that (3) < @ + (5) cee (")] : > (2) 


20. Find the value of & that maximizes 


“(9 es)RS9 


21. Evaluate > nal 
vatuate 5 (7) k-1 


22. Prove that & + 3(5) fe 5(3) nuts 2(') § 4(") H o(”) ie ie 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 
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Evaluate a — 2(*) + (5) reeset (CH) + (") 


n-1 
Show that >) (— r( i; A :) = 1. (Hint: Rewrite as a binomial coefficient 
k=0 


with k alone in the bottom.) 


“  (2n)!  _ (2n\? 
Show that 4 in — bE — DP k 


n 
: ; : n\{m 
Give a combinatorial argument to evaluate >> ( "( a n=m. 
k=0 


(a) For any given k, show that an integer n can be represented as: 


=(") + (mm) ee (™) where 0 < m;< m<+++<m, 


(b) Prove that for a given k, the representation in part (a) is unique. 


[Hint: Use the fact that 


CoG) 


k =e _ 7 
Show that ("44 5 ae De ae ' 


J=0 


m 2n ps 
Show that 3) 24) _ : DG Kyte sa, 
40 


k=] P(n, k) n = 
() 


“ mi(n — k)! n+1 
Show that a 
_ at 2 olin = seers 


m=n. 


n+1\n 


 {n\(n-k\ 2 fn 
Show that >, Glee =2 (").m<n 


Consider the problem of three-dimensional block walking. Show by a 
combinatorial argument that P(n; i — 1, j,k) + Pty i,j -— 1,k) + 
P(n; i, j,k — 1) = Pant 153, 7, &. 


Give a combinatorial argument to show that 


fC, = ae () show that C,,, = > Cx -2Com—2k- 
= 


(tx tee + xy) =D P(N; ii, iy, , igociix® .. . xt 


where the sum is over all i) + ip +--+ +i, =n,i,20. 
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35. Show for sums over alli, ti te-+ +h Hn = O, that 
(a) > P(n; iy, in, -5 i) = K" 
(b) Sfp te PO hy by ee = PO, Ak 


5.6 GENERATING PERMUTATIONS AND 
COMBINATIONS AND PROGRAMMING 
PROJECTS 


Having spent a whole chapter counting different types of arrangements and 
selections, it is only fair to say a little about actually enumerating, or listing, 
these permutations and combinations. In this section, we present algorithms 
for enumerating all permutations of an n-set and all r-combinations of an n- 
set. With these algorithms, one can write programs to enumerate all the elements 
in various combinatorial sets; we do this in the second half of the section. 

The following examples illustrate how such algorithms are used in opera- 
tions research. Suppose that we have eight different products to manufacture 
in a month at a factory. Given the cost of converting from product i to product 
j, we must find the order of products that minimizes the sum of the conversion 
costs. The easiest solution to this problem is simply to generate all arrangements 
and see which one costs least (a ‘‘branch and bound’ approach to this type of 
problem is given in Section 3.4). 

Suppose that each of 10 workers knows a different subset of 30 skills 
needed on a project, and we must find all groups of 5 workers who collectively 
know all 30 skills. We need to check all 5-subsets of the 10 workers to find 
the ones that satisfy this special condition (this also can be solved by ‘‘branch 
and bound’’ methods). 

There are several different algorithms for enumerating permutations and 
combinations. We will present algorithms that enumerate these collections in 
lexicographic order, that is, we treat an arrangement of objects as a sequence 
of ‘‘letters’? forming a ‘“‘word’’ and list these ‘‘words”’ in alphabetical order. 
This approach requires that the objects be numbered (or otherwise ordered). 
We write A < B, if sequence A is lexicographically smaller than B. A subset 
is treated similarly, with the “‘letters’’ (objects) of each subset arranged in 
increasing alphabetical order. For example, Table 5.1 lists all permutations of 
a, b, c in lexicographic order, and Table 5.2 lists all 3-combinations of 1, 2, 
3, 4, 5 in lexicographic order. 

To list all permutations of an n-set in lexicographic order, we need to be 
able to determine for any given permutation A = a)@)@;... A,, what is the 
next permutation B = b,bnb;... b, in lexicographic order. If a,-1 <n, then 
interchanging the positions of a,-, and a, in A yields a lexicographically larger 
sequence, the desired sequence B. For example, 14325 is followed by 14352. 
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Table 5.1 
1. abc 
2. acb 
bac 
bca 


cab 


Dye Oo 


cba 


Suppose a,-, > a,. Then interchanging the positions of a,-, and a, in A 
yields a lexicographically smaller sequence. Instead, we must turn to a,_». If 
,-2 < a,-;, then the last three digits of A can be rearranged to yield a lexicograph- 
ically larger sequence. To increase the sequence as little as possible, position 
n — 2 should have the next larger ‘‘digit’’ (larger than a,_») chosen from a,_, 
and a,_>. The remaining two of the original last three ‘‘digits’’ go in positions 
n — 1 and n in increasing order. For example, 14352 would be followed by 
14523, while 14253 would be followed by 14325. If a,-. > a,-; (and a,-,; > 
a,), then we must turn to a,_3. 

In general, we start from the right end of the sequence and, moving left, 
look for the first pair a;, a;.; such that a; < a4, (and aj; > jx. > ain3 > 
"++ > a,-; > a,). Then the next larger sequence is obtained by placing a, = 
min{a; | a; > a; and j > i} in position i and placing the remaining digits a;, 
Gisi, ..., Qn (excluding a,) in positions i + 1 through n in increasing order. 


Algorithm for Next Sequence in Lexicographic Listing 
of Permutations 


PROCEDURE NEXTPERMUT(A, n) 
Comment—find lexicographically next permutation of n-element 
array A 
Syntactic Comment—coto’s are used instead of wHie statement for 
simplicity. 
BEGIN 
FOR i <— N-1TO 1 STEP —1 DO 
iF A(i) < A(i + 1) THEN GoTo 1); 
Comment—find min A(h) such that A(h) > Ali) and swap A(h), Ali) 


Table 5.2 
1. 123 6. 145 
2. 124 7. 234 
3, 125 8. 235 
4. 134 9. 245 
5. 135 10. 345 
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1) ror h<—ntoi+| step —1 DO 
ip A(i) < A(h) THEN GOTO 2); 
2) temp — A(i); Ali) — Ath); A(h) — TEMP, 
Comment—complete next arrangement by reversing rest of sequence 
FoR h— i+ 17T0nbo TEM(h) — Alh); 
ror Ah <—i +110 nvoA(A) — TEM(n+ i+ 1-4); 
END NEXTPERMUT. 


Next we show how to generate all r-combinations of an n-set in lexico- 
graphic order. For simplicity, let us assume that the n-set is the set of integers 
1,2,..., 1. Recall that we list the elements in each r-combination in increasing 
order (see Table 5.2). The successor b,b, ... b, to the current r-combination 
a\a,...a, is found by searching from right to left until the first a, # nm —rt+ 
iis found. For this i, set bh} = a; + 1 and b; = b-, + 1, fori<j Sr. 


Algorithm for Next Subset in Lexicographic Listing 
of Combinations 


PROCEDURE NEXTSUBSETIA, n, r) 

Comment—given an r-subset A, find the lexicographically next 

r-subset from 1, 2,3,..., 9. 
BEGIN 

FOR i < TO 1 STEP —1 DO 
iF Ali) < n- 9+ i THEN GOTO 1); 
1) Ali) — Ali) + 1; 
ror he—i+1T0 rpo A(h) — A(h—- 1) + 1; 

END NEXTSUBSET. 


Programming Projects 


Using the preceding algorithms, we can enumerate most combinatorial sets. 
Recall from Section 5.3 that selection with repetition can be modeled as a 
selection without repetition problem and that arrangement with repetition can 
be modeled as a succession of selection problems. 

The following examples give computer programs to list all outcomes in 
two set enumeration problems from preceding sections. 


Example 1 


List all outcomes in Exercise 20 of Section 5.2. How many ways are there to 
pair off 10 women at a dance with 10 out of the 20 available men? 

Let the men be named 1, 2, 3,..., 20. Let B be the array with the current 
subset of 10 men being paired (in 10! ways) with the 10 women. Let A(i) be 
the man paired with the ith women, i = 1, 2,..., 10. Procedures NEXTPERMUT 
and NEXTSUBSET are given earlier in this section. 
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BEGIN 
Comment—construct lexicographically first subset of 10 men 
FOR / <I To 10 do Bi) Kis 
Comment—loop to list all 10-subsets of the 20 men 


FoR j/ — 1TO (33) DO 
10 
BEGIN 
FOR / — 1 To 10 vo A(/) <—B(i); 
Comment—loop to list all arrangements of men in subset A with 
the women 
FOR kK — 1 To 10! Do 
BEGIN 
print out pairing (i, A(i), i= 1,2,..., 10); 
NEXTPERMUTI(A, 10) 
END; 
NEXTSUBSET(B, 20, 10); 
END; 
END. @ 


Example 2 


List all outcomes in Exercise 7 of Section 5.4. How many ways are there to 
arrange the letters in VISITING with no pair of consecutive Is? 

Let OUTCOME(k), k = 1, 2, ..., 8, be the array where the possible 
outcomes will be constructed. Let CONS(k), k = 1,2,...,5, be the permutation 
of the consonants V, S, T, N, G to be used in OUTCOME. 

Let POS(j), j = 1, 2, 3, be the positions in array OUTCOME where Is 
will occur. Think of the 6 locations existing before, between, and after the five 
consonants G, N, S, T, V in VISITING. We pick a subset of 3 of these 6 
locations for the 3 Is (this approach to nonconsecutive elements is discussed 
in Exercise 37 of Section 5.4). Suppose we picked locations 2, 3, 6. Location 
2 for the first I is between the first and second consonant, and so the first I 
goes in position 2. Location 3 for the second I is between the second and third 
consonant, but also after the first I, and so the second I will be in position 4 
(not position 3). Location 6 for the third I comes after all five consonants and 
after the other two Is, and so the third I goes in position 8. In general, the 
position of the first I is the first number in the 3-subset. The position of the 
second I is the second number in the 3-subset plus 1. The position of the third 
Tis the third number in the subset plus 2. 


BEGIN 
Comment—construct lexicographically first 3-subset of 1, 2, 3, 4, 
5,6 

FOR i <— 1103 Do P(/) i: 
Comment—ioop on all 3-subsets of 1, 2, 3, 4, 5, 6. 
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FOR i <— 1 TO 3 DO 


BEGIN 
Comment—convert 3-subset into positions for Is: 
For K<— 170 3 DO POS(k) < Pik) + k- 1; 
Comment—construct lexicographically first permutation 
of consonants 
CONS(1) —‘G’; CONS(2) — ‘N’; CONS(3) —‘S'; 
CONS(4) —‘T’; CONS(5) — “V'; 
Comment—loop on all permutations of the five conso- 
nants 
FOR jf — 7705! DO 
BEGIN 
Comment—construct current OUTCOME 
i1<— 1; f/7<— 1; 
For kK < 1708 D0 
ip kK = POS(i7) THEN 
BEGIN 
OUTCOME(k) —‘I'; 11<—i1+1 
END; 
ELSE 
BEGIN 
OUTCOME(k) — CONS(/1); {1 — j1 + 1; 
END; 
PRINT OUTCOME(i), i= 1, 2,...,8; 
NEXTPERMUT(CONS, 5); 
END; 
NEXTSUBSET(P, 6, 3); 
END; 
END. @ 


5.6 EXERCISES 


Y/ Summary of Exercises The first five exercises use the three 


algorithms presented in this section. The next six exercises involve variants and 
related properties of these algorithms. The last nine exercises are programming 
projects. 

1. Enumerate all arrangements of 1, 2, 3, 4 in lexicographic order. 

2. Enumerate all arrangements of a, c, e, A in lexicographic order. 

3, Enumerate all 4-subsets of 1, 2, 3, 4, 5, 6 in lexicographic order. 
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. Enumerate all 3-subsets of a, d, h, j in lexicographic order. 


. What are the lexicographically first and last elements in: 


(a) The collection of all arrangements of the integers 1, 2,..., 10? 
(b) The collection of all 4-subsets of the 26 letters? 


. Prove by induction that the successor rule given in this section correctly 


finds the next lexicographic element in: 
(a) The collection of all arrangements of an n-set. 
(b) The collection of r-subsets of an n-set. 


7. Devise another algorithm for enumerating all arrangements of an n-set. 


8. Devise another algorithm for enumerating all r-subsets of an n-set. 


10. 


11. 


. (a) Develop a procedure to compute the location of a given arrangement 


in the lexicographic order of all arrangements of 1, 2,..., 7. 


(b) What is the location of 314526 in the lexicographic order of 1, 2,..., 
6? 


(a) Develop a procedure to compute the location of a given r-subset in 
the lexicographic order of all r-subsets of 1, 2,..., n. 


(b) What is the location of {3,4,7} in the 3-subsets of 1, 2,..., 8? 


Devise a method to list all subsets of an n-set in lexicographic order. 


Programming Projects 


12. 


13. 


The following table gives the cost of converting from job i to job j. In 
what order should the four jobs be performed in order to minimize the 
total conversion costs? 


From\To 1 2 


3 4 
5 2: 
4 8 

i 


The following table tells which people know each of a set of trades. Find 
all subsets of four people who know all 10 trades. 
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Person\Trade 1 2 3 4 5 6 67 8 9 10 
1 1 1 oO 1 0 oO oO 1 0 0 
2 1 O 1 0 1 0 1 0 O 1 
3 0 1 0 0 0 41 0 oO 1 1 
4 0 oOo 1 1 1 0 61 1 0 0 
5 0 1 0 1 0 oO 1 0 1 0 
6 1 oO 1 0 1 0 0 O 1 0 
7 1 0 oO 1 Oo 1 0 oO O 1 


14. A salesperson wants to make a tour visiting each of five cities, starting 
and ending at city 1. Use the intercity cost matrix below to find the cheapest 
tour. 


From\To 1 2 3 
1 5 
3 


MANA NIN 


2 
3 
4 
5 


15. Write a program to list all 5-subsets of 1, 2, 3, 4, 5, 6 with repetition. 


16. Write a program to list all arrangements of three as, three bs, and three 
cs. 


17. Write a program to list all distributions of r identical objects into n different 
boxes with between three and six objects in each box. 


18. Write a program to list all integer solutions of x, + x. + +--+: +x, =A 
where: 
(a) x =k (bb) OSx,Sm53m5°°°S2, 


19. Write a program to list all distributions of r distinct objects into n different 
boxes with at least m objects in each box. 


20. Write a program to list all v-subsets of 1,2,.. . , with no pair of consecutive 
integers. 


iy 
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5.7 SUMMARY AND REFERENCES 


This chapter introduced the basic formulas and logical reasoning used in arrange- 
ment and selection problems. There were four formulas for arrangements and 
selections with and without repetition, and there were three principles for 
composing subproblems. But these formulas and principles were just the build- 
ing blocks for constructing answers to dozens of examples and hundreds of 
exercises. These problems required a thorough, logical analysis before one could 
begin to decompose them into tractable subproblems. Such logical analysis of 
possibilities arises often in computer science, probability, and operations 
research. It is the basic methodology of discrete mathematics and the most 
important skill for a student to develop in this course. Note that such logical 
reasoning is the very essence of mathematical model building. 

The n! formula for arrangements was known at least 2500 years ago (see 
David [2] for more details of the history of combinatorial mathematics). Prob- 
lems involving binomial coefficients and the binomial expansion were men- 
tioned in a primitive way in Chinese, Hindu, and Arab works 800 years ago. 
Pascal’s triangle appears in a 14th century work of Shih-Chieh (see text cover). 
The first appearance of the triangle in the West was 200 years later. However, 
it was not until the end of the seventeenth century that Jacob Bernoulli gave 
a careful proof of the binomial theorem. The examination of probabilities related 
to gambling by Pascal and Fermat around 1650 was the beginning of modern 
combinatorial mathematics. The formula for selection with repetition was dis- 
covered soon afterward in the following context: the probability when flipping 


rt+n 
a coin that the nth head appears after exactly r tails is (3) Caant+r-1,r) 


(there are n ‘‘boxes’’ before and between the occurrences of the n heads for 
placing the r tails). The formula for arrangements with repetition was discovered 
around 1700 by Leibnitz in connection with the multinomial theorem (see 
Exercise 36 of Section 5.5). Jacques Bernoulli’s Ars Conjectandi (1713) was 
the first book presenting basic combinatorial methods. The first comprehensive 
textbook on permutation and combination problems was written by Whitworth 
[6] in 1901. 

The problem of systematically enumerating arrangements, selections, and 
other combinatorial collections was not addressed until the advent of modern 
digital computers. One of the first people to work on such enumeration in the 
1950s was Lehmer [3]. 

For further information about algorithms for enumerating permutations and 
combinations, see Reingold et al. [4]. 

See General References (at end of text) for a list of other introductory texts 
on enumeration. 

1. R. Buck, Advanced Calculus, 2nd ed., McGraw-Hill, New York, 1965. 


2. F. David, Games, Gods, and Gambling, Hafner Press, New York, 1962. 
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3. D. Lehmer, ‘‘The Machine Tools of Combinatorics,’’ in Applied Combina- 
torial Mathematics, E. Beckenbach (ed.), John Wiley & Sons, New York, 
1964. 


4, E.Reingold,J. Nievergelt, and N. Deo, Combinatorial Algorithms, Prentice- 
Hall, Englewood Cliffs, NJ, 1977. 


5. J. Riordan, Combinatorial Identities, John Wiley & Sons, New York, 1968. 


6. W. Whitworth, Choice and Chance, 5th ed. (1901), Hafner Press, New 
York, 1965. 


SUPPLEMENT: 
SELECTED SOLUTIONS TO PROBLEMS IN CHAPTER 5 


Section 5.1 


Exercise 13 

(a) How many different five-digit numbers are there (leading zeros, e.g., 00174, 
not allowed)? 

(b) How many even five-digit numbers are there? 

(c) How many five-digit numbers are there with exactly one 3? 


(d) How many five-digit palindromic numbers (numbers that are the same 
when the order of their digits is inverted, e.g., 15251) are there? 


Answer: 


(a) All numbers from 10,000 to 99,999 form the 5-digit numbers. Answer is 
9 x 10 x 10 X 10 x 10. 


(b) For a number to be even, the rightmost position must have an even digit 
(0 or 2 or 4 or 6 or 8). Answer: 9 X 10 X 10 X 10 X 5. 


(c) If leftmost digit is 3, there are 9* numbers. If any other digit is 3, 8 X 9° 
numbers. Total is 9* + 4 X (8 X 9°). 


(d) A palindromic number must have the same digit in the leftmost and 
rightmost position and similarly for the second-leftmost and second- 
rightmost positions. Then the problem becomes one of counting the ways 
to pick values for 3 leftmost digits (the 2 right digits are then forced by 
palindromic symmetry)—9 X 10 X 10 ways. 


Exercise 27 


How many times is the digit 5 written when listing all numbers from 1 to 
100,000? 
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Answer: 


It is simpler to answer this question for numbers between 0 and 99,999 (adding 
0 and omitting 100,000 makes no difference since neither contains a 5). We 
thus ask how many times 5 in appears in writing 5-digit sequences (5-digit 
numbers with leading Os allowed). Consider the problem, How many times 
does a 5 occur in the third (middle) position in these 5-digit sequences, that 
is, how many 5-digit sequences are there with a 5 in the third position? The 
answer is 10*. For all 5 positions in the sequence, the answer is 5 X 10%. 


Exercise 29 


How many four-digit numbers are there formed from the digits 1, 2, 3, 4, 5 
(with possible repetition) that are divisible by 4? 


Answer: 


A number is divisible by 4 if and only if the number formed by its two rightmost 
digits are divisible by 4. Also to be divisible by 4, the 1s digit (the rightmost 
digit) must be even, in this problem, 2 or 4. Observe that composing any given 
10s digit with the two even Is digits, such as 32, 34, we get two consecutive 
even numbers. Exactly one of every two consecutive even numbers is divisible 
by 4. So to generate a number divisible by 4 using digits 1, 2, 3, 4, 5 there 
can be any of these digits in the 1000s, the 100s and 10s positions followed 
by one choice for the 1s position—5* ways. 


Section 5.2 


Exercise 33 


What is the probability that an arrangement of INSTRUCTOR has: 
(a) Three consecutive vowels? 


(b) Two consecutive vowels? 


Answer: 


Total number of arrangements is C(10, 2)XC(8, 2)X6! and so outcomes in 
parts (a) and (b) will be divided by this amount to get the probabilities of these 
events. 

(a) Replace the sequence of 3 consecutive vowels by a special symbol V. Now 
we arrange the 8 letters, C, N, R, R, S, T, T, V. We pick a pair (subset of 
size 2) of positions for the 2 Rs, then a pair of (remaining) positions for 
the Ts, and then arrange the 5 distinct letters—C(8, 2) X C(6, 2) X 5! 
arrangements. Next there are 3! arrangements of the 3 vowels to put in 
place of V. Answer: 3! X C(8, 2) X C(6, 2) x 4! 

Initially proceed as in part (a), now with V as a double vowel and V’ as 
a single vowel. There are C(9, 2) X C(7, 2) X 5! arrangements of C, N, 
R, R, S, T, T, V, V’. There are 3 choices for the vowel to be V’ and 2 


(b 


— 
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arrangements of the remaining vowels for V. In total, 3 X 2 X C(9, 2) X 
C(7, 2) X 5! arrangements. However, if V is followed (immediately) by 
V’ or if V’ is followed by V, we obtain three vowels in a row. Every 
arrangement with three vowels in a row will be generated in two different 
ways, e.g., TIOUCRRTNS arises from T(IO)(U)CRRTNS and 
T(D)(OU)CRRTNS. Subtracting arrangements with 3 vowels in a row [part 
(a)], we have {3 X 2 X C(9, 2) X C(7, 2) X S5!} — (3! x C(8, 2) X Cl, 
2) X 4}. 


Exercise 39 


(a) What is the probability that k is the smallest integer in a subset of four 
different numbers chosen from 1 through 20 (1 = k = 17)? 


(b) What is the probability that k is the second smallest? 


Answer: 


(a) As in most counting problems, the key here is to focus on the numbers 
that remain to be chosen, not on what one is told must be in the subset. 
To be concrete, initially assume k = 8. For 8 to be the smallest number 
in the subset, the other three numbers in the subset must be larger than 
8—chosen in C(20 — 8, 3) ways. For a general k, the probability is 
C(20 — k, 3)/C(20, 4). 

If 8 is the second smallest number in the subset, there is one smaller 
number—chosen in C(8 — 1, 1) ways—and two larger numbers—chosen 
in C(20 — 8, 2) ways. In general, the probability is C(kA — 1, 1) X 
C20 — k, 2)/C(20, 4). 


(b 


— 


Exercise 43 


What is the probability that 2 (or more) people in a random group of 25 people 
have a common birthday? 


Answer: 


The ‘‘trick’’ in this problem is that it is easier to count the probability that no 
one has a common birthday and subtract this probability from 1. We want the 
fraction of possible birthday dates for 25 people in which everyone has a 
different birthday. The denominator, all possibilities of various birthdays for 
the 25 (different) people, is 365°. The numerator, the possibilities where every- 
one has a different birthday, is P(365, 25). The desired probability equals 1 — 
P(365, 25)/365*. (With a calculator or computer, one determines this probability 
to be about .57.) 


Exercise 47 


Given a collection of 27 objects, 7 identical and the other 7 all distinct, how 
many different subcollections of n objects are there? 
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Answer: 


The trick is to decide which of the n distinct objects will be in the collection. 
Any subset can be chosen of distinct objects—C(n, 0) + C(n, 1) + C(a, 2) + 

- + C(n, n) ways—with the remaining elements made up of identical 
objects—done in 1 way (since the objects are identical). An alternative approach 
is to say that we have the choice to use or not use each distinct object—2" 
outcomes. 


Exercise 55 


What is the probability that a random five-card hand has: 
(a) Exactly one pair (no three of a kind or two pairs)? 


(b) One pair or more (three of a kind, two pairs, four of a kind, full house)? 
(c) The cards dealt in order of decreasing value? 


(d) At least one spade and at least one heart and the values of the spades are 
all greater than the values of the hearts? 


Answer: 


The denominator is C(52, 5). 

(a) There are C(13, 1) X C(4, 2) ways to pick a pair (2 cards of the same 
kind). To fill out the rest of the hand, pick one card of a second kind— 
48 ways—then one card of a third kind—44 ways—and finally one card 
of a fourth kind—40 ways. The problem is that this rest-of-hand count is 
ordered; that is, the sequence of choices 7@, QY, 5@ yields the same 
rest-of-hand as 5@, QY, 7&. Divide by 3! to ‘‘un-order’’ this rest-of- 
hand, yielding the answer: {C(13, 1) X C(4, 2) x (48 X 44 x 40/3!)}/ 
C(52, 5). 

Another approach for the rest-of-hand is to pick a subset of 3 other 
kinds that will appear in the rest-of-hand—C(12, 3) ways—and pick a 
card of each of these kinds—4’ ways, yielding {C(13, 1) x C(4, 2) xX 
C(12, 3) X 4°}/C(52, 5). 

(b) Determine the probability of each of the 5 cards being of a different kind 
and subtract this probability from 1: 1 — C(13, 5) X 4°/C(52, 5). 

(c) Any 5-card hand has a 1/5! chance of being dealt in a particular order 
(increasing or otherwise). 

(d) Pick a subset of the 5 kinds (values) that will appear in the hand—C(13, 
5) ways. The lowest k (1 = k = 4) kinds must be hearts and the other 
kinds spades, It remains only to pick the value of K—4 choices: 4 x C(13, 
5). (Wasn’t that sneaky!) 


Exercise 63 


(a) How many points of intersection are formed by the chords of an n-gon 
y Pp ; bY. g 
(assuming no 3 of these lines cross at one point)? 
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(b) Into how many line segments are the lines in part (a) cut by the intersection 
points? 

(c) Use Euler’s formula r = e — v + 2 and parts (a) and (b) to determine the 
number of regions formed by the chords of an n-gon. 


Answer: 


(a) Every subset of four vertices of the n-gon forms a unique intersection point 
generated by the intersection of the two chords joining opposite vertices 
in the subset of four vertices. Thus the answer is C(n, 4). 


(b) The number of chords is C(n, 2) — x (all pairs of vertices are connected 
by chords except for the n edges of the polygon). Each intersection point 
splits two line segments, increasing the number of line segments by 2. 
Then the answer to part (b) is C(n, 2) — n + 2C(a, 4). 


The total number of edges e equals the number of line segments— 
C(n, 2) — n + 2C(n, 4) [from part (b)]—plus the number of edges of the 
polygon—n edges—yielding e = C(n, 2).+ 2C(n, 4). The number of 
vertices v is the number of polygon vertices plus intersection points, v = 
n + C(n, 4). By Euler’s formula, the number r of regions (excluding the 
infinite region) isr = e — v + 1 = [C(m, 2) + 2C(n, 4)] — [n + Ca, 
4)) + 1 = C(n, 2) + CQ, 4) — 1 + 1. 


— 


(c 


Exercise 67 


A man has seven friends. How many ways are there to invite a different subset 
of three of these friends for a dinner on seven successive nights such that each 
pair of friends are together at just one dinner? 


Answer 


We need to find all possible unordered collections of seven 3-subsets such that 
each pair of the seven friends appears in exactly one subset. Consider the 
following table of 3-subsets (an X means that the element of that row is in the 
subset of that column): 


Subsets 


Qnmoaw SS 
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Let the first three 3-subsets be the ones involving friend A. There are 


(8) x (5) x (3)| / 3! = 15 collections of 3-subsets involving A— we 


count all ways to pair off the other 6 friends into 3-subsets containing A, and 
divide by 3! so that the pairings are not ordered. Let the first subset involve B 
as well as A. A general collection of three 3-subsets with A has the form (A, 
B, C’), (A, D’, E’), (A, F’, G’) where C’ is the other friend in the subset with 
A and B, D’ is the (alphabetically) first of the friends not in the first subset, 
E’ is the other friend in the subset with A and D’, and F’ is alphabetically 
earlier than G’. Replacing C, D, E, F, and G by C’, D’, E’, F’, and G’ in the 
above table, the only remaining choice is whether B or C’ forms 3-subsets with 
D’, F’ and E’, G’—2 choices. So there are 15 X 2 collections of seven 3- 
subsets of the seven friends such that each pair appears in one 3-subset. 

Each such collection can be arranged over the seven successive nights in 
7! ways. So the answer is 15 X 2 X 7!. 


Section 5.3 


Exercise 17 


How many ways are there to split a group of 2 as 2n Bs, and 27 ys in half 
(into two groups of 37 letters)? (Note: The halves are unordered; there is no 
first half.) 


Answer: 


Count the ways to select 37 letters from the 3 types of letters and then sub- 
tract outcomes with 2n + 1 or more of one letter—C(3n + 3 — 1, 3n) — 3 X 
Cn — 1) + 3 — 1, — 1). Since each split forms two groups of 3n letters, 
it appears we should divide this count of 3n-letter groups by 2. However, the 
split in which each group consists of n letters of each type contains two cop- 
ies of the same group (this split is not double counted). So the answer is 
al[CGn + 3 — 1,3n) —-3 X C(n- 1) +3-1,n-1)- 1] +1. 


Exercise 20 
How many arrangements of six Os, five 1s, and four 2s are there in which: 


(a) The first O precedes the first 1? 
(b) The first 0 precedes the first 1 which precedes the first 2? 


Answer: 


(a) Position the four 2s among the 15 positions —C(15, 4) ways—then put a 
O in the first of the remaining positions—1 way—and then pick other five 
positions for the remaining Os—C(10, 5) ways. Answer: C(15, 4) X 
C10, 5). 

(b) Put a 0 in the first position—1 way—then pick five other positions for the 
remaining Os—C(14, 5) ways—then put a 1 in the first of the remaining 
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positions—1 way—and then pick four other positions for the remaining 
1s—-C(8, 4) ways: C(14, 5) X C(8, 4). 


Exercise 21 


How many arrangements are there of 4n letters, four of each of n types of 
letters, in which each letter is beside a similar letter? 


Answer: 


There cannot be exactly three consecutive letters the same, because that would 
leave the fourth letter of that type alone. So there are either two in a row or 
four in a row. But four in a row is the same as two consecutive two-in-a-rows. 
So our problem reduces to counting arrangements of 2” objects (two-in-a-rows) 
of n types with 2 of each type. There are 2n!/(2!)" such arrangements. 


Exercise 23 


When a coin is flipped n times, what is the probability that: 
(a) The first head comes after exactly m tails? 


(b) The ith head comes after exactly m tails? 


Answer: 


(a) There are 2” outcomes in all. The sequence of flips begins with m successive 
tails followed by a head. The sequence can be completed in 2"-"*” ways. 
Probability: 2”-"*)/2" = 2-@*D, 

(b) If the ith head comes after exactly m tails, then the first m + (i — 1) flips 
contain m tails and i — 1 heads—C(m + (i — 1), m) ways. The remaining 
flips are unrestricted—2""*? ways. Probability: C(m + (i — 1), m)2"-*0/ 
2" = C(m + i — 1, m)2-"?, ; 


Exercise 26 


How many arrangements of 5 as, 5 Bs and 5 ys are there with at least one 8 
and at least one y between each successive pair of as? 


Answer: 


There are three cases: 

(a) Exactly one 6 and exactly one y between each pair of as: Between each 
of the four pairs of as, the B or the y can be first—2* ways. The fifth B 
and fifth y along with the sequence of the rest of the letters can be considered 
as 3 objects to be arranged—3! ways. Altogether, 2* x 3! = 96 ways. 


Exactly one 8 between each pair of as and two ys between some pair of 
as (or two Bs between some pair of as and exactly one y between each 
pair of as): There are 4 choices for between which pair of as the two ys 
go and 3 ways to arrange the two ys and one § there. There are 2? choices 
for whether the 6 or the y goes first between the other 3 pairs of as and 


(b 


— 


(c) 


Supplement: Selected Solutions to Problems in Chapter 5 239 


2 choices for at which end of the arrangement the fifth 8 goes. Multiplying 
by 2 for the case of two Bs between some pair of as, we obtain 2 X 
(4 X 3 X 23 X 2) = 384 ways. 


Two Bs between some pair of as and two ys between some pair of as: 
There are two subcases. If the two Bs and two ys are between the same 
pair of as, there are 4 choices for which pair of as, C(4, 2) ways to arrange 
them between this pair of as, and 23 choices for whether the B or the y 
goes first between the other 3 pairs of as. If two Bs and two ys are between 
the different pairs of as, there are 4 X 3 ways to pick between which as 
the two Bs and then between which as the two ys go, 3? ways to arrange 
the two ys and one 8 and to arrange the one y and two Bs, and 2? choices 
for whether the 6 or the y goes first between the other 2 pairs of as. 
Together, 4 x C(4, 2) x 23+ 4 x 3 x 3? x 2? = 1056 ways. 


Section 5.4 


Exercise 19 


If n distinct objects are distributed randomly into n distinct boxes, what is the 
probability that: 
(a) No box is empty? 


(b) 
(c) 


Exactly one box is empty? 
Exactly two boxes are empty? 


Answer: 
(a) n!/n". 


(b) 


(c) 


Pick which box is empty—n choices—then which other box gets two 
objects—n — 1 choices—then which two objects go into this box—C(n, 
2) choices—and then distribute the remaining n — 2 objects into the 
remaining n — 2 boxes, one per box—(n — 2)! ways. Probability: n x 
(n — 1) X C(n, 2) X (n — 2)!/n". 

Pick which two boxes are empty—C(n, 2) choices. Two cases: Case (i). 
One box has three objects: pick the box with three objects—n — 2 choices— 
then pick which three objects go in this box—C(n, 3) choices—and then 
distribute the remaining n — 3 objects into the remaining n — 3 boxes— 
(n — 3)! ways. Case (ii). Two boxes which each have two objects: pick 
the two boxes with two objects—C(n — 2, 2) choices—then pick which 
two objects go into the first 2-object box and which two objects go into 
the second 2-object box—C(n, 2) x C(n — 2, 2) choices—and then 
distribute the remaining n — 4 objects into the remaining n — 4 boxes— 
(n — 4)! ways. The probability is C(n, 2) X {[@ — 2) X C(n, 3) X (n - 
3)!] + C(n — 2, 2) X Cn, 2) X C(n — 2,2) X (A - 4)! }/n". 
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Exercise 20 


How many ways are there to distribute eight balls into six boxes with the first 
two boxes collectively having at most four balls if: 
(a) The balls are identical? 


(b) The balls are distinct? 


Answer: 
Break into 5 cases of 0 or 1 or 2 or 3 or 4 balls in first two boxes 


4 
(a) & CK +2-1,)XCB-k+4-1,8-4 
k=0 


4 
(b) >) C(8, k) X 2* x 48: If k distinct balls are in the first two boxes, pick 
k=0 


which k balls—C(8,k) ways—then decide for each ball into which of the 
first two boxes it goes—2‘ ways—and then distribute remaining 8 — k 
distinct balls into the other 4 boxes—4** ways. 


Exercise 38 


What fraction of all arrangements of WISCONSIN without any pair of consecu- 
tive vowels have W adjacent to an I? 


Answer: 


Arrangements of WISCONSIN without any pair of consecutive vowels: 
Cl(6 — 2) + 4 — 1, (6 — 2)] = CC7, 4) patterns of vowels and consonants 
without consecutive vowels; 3 ways to distribute I, 1, O among 3 vowel positions; 
6!/2!2! ways to distribute consonants among consonant positions. Denominator 
in fraction is then C(7, 4) X 3 X 6!/2!2! 

For the numerator, we look at three cases: 

(a) W is beside an I that is the first vowel. If W is just before the I (glue W 
and I together), there are C((5 — 2) + 4 — 1, (5 — 2)) = C(6, 3) patterns 
for distributing the other consonants to assure no consecutive vowels, 
whereas if W is just after I, there are C(5 — 1) + 4-— 1,( -— 1) = 
C(7, 4) patterns. In either case, there are 2 ways to order the other vowels 
and 5!/2!2! ways to arrange the other consonants. 


(b) If W is beside an I that is the last vowel, the number of outcomes is the 
same as in case (a). 


If W is beside an I that is the middle vowel, there are C((5 — 1) + 4 — 
1, (5 — 1)) = CQ, 4 patterns for distributing the other consonants to 
assure no consecutive vowels whether W is just before or just after the 
I—2 choices for W. Again the other vowels and other consonants can be 
placed in 2 X 5!/2!2!ways. We must subtract the cases with non-consecutive 
‘vowels’ IWI and O: C(5 — 1) + 3 — 1, (5 — 1)) X 2 X 51/2!2! In total, 
the numerator is [2 X C(6, 3) + 4 X C(7, 4) — C(6, 4] X 2 X 5!/2!2! 


(c 


— 
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Exercise 45 


How many subsets of six integers chosen (without repetition) from 1, 2,..., 
20 are there with no consecutive integers (e.g., if 5 is in the subset, then 4 and 
6 cannot be in it)? 


Answer: 


Form a binary sequence of length 20 with six 1s and fourteen Os to represent 
which integers are in the subset (a | in the ith position means that i is in the 
subset). In this form, we seek all 20-di git binary sequences with 6 nonconsecu- 
tive Is. C(14 — 5) + 7 — 1, (14 — 5)). 


Exercise 47 


How many arrangements are there of n Os and m 1s with & runs of Os? A run 
is a consecutive set (1 or more) of the same digit; e.g., 0001110100 has three 
(underlined) runs of Os. 


Answer: 


Represent a run no matter what its length as an R. Then we arrange m 1s and 
k Rs with no consecutive Rs. Using the reasoning in Exercise 45, there are 
C(m + 1, k) arrangements. Next pick how many Os there are in each run. We 
select distribute the n Os into k boxes (runs) with at least one 0 in each box — 
C(n — 1, k — 1) ways: C(m + lk) xX Cn —- 1,k - 1). 


Exercise 50 


How many ways are there to distribute 20 distinct flags onto 12 distinct flagpoles 

if: 

(a) In arranging flags on a flagpole, the order of flags from the ground up 
makes a difference? 


(b) No flagpole is empty and the order on each flagpole is counted? 


Answer: 


(a) Lay the flagpoles on the ground end-to-end with a slash (|) between flag- 
poles. Then we need to count all arrangements of the 20 distinct flags and 
(12 — 1) |s—31!/11! ways. 

(b) First we distribute 20 identical flags into 12 flagpoles with no flagpole 
empty in C(20 — 1, 12 — 1) ways (according to Example 4). Now as in 
part (a), lay the flagpoles end-to-end. Then arrange the 20 distinct flags 
among the 20 places where there are (identical) flags in 20! ways: 
c(20 — 1,12 — 1) x 20! 


CHAPTER 6 


GENERATING 
FUNCTIONS 


6.1 GENERATING FUNCTION MODELS 


In this chapter, we introduce the concept of a generating function. Generating 
functions are a convenient tool for handling special constraints in selection and 
arrangement problems with repetition. They are used in Chapters 7, 8, and 9 to 
solve other combinatorial problems. Generating functions are the most abstract 
problem-solving technique introduced in this text. But once understood, they 
are also the easiest way to model a broad spectrum of combinatorial problems. 

Suppose a, is the number of ways to select r objects in a certain procedure. 
Then g(x) is a generating function for a, if g(x) has the polynomial expansion 


g(x) = ant ax tar +-+> tax te-> + a,x 


If the function has an infinite number of terms, it is called a power series. 
In Section 5.5 we verified the well-known binomial expansion 


asapere (Mee (Berea (aree k(x 
1 2 r n 


Then g(x) = (1 + x)" is the generating function for a4, = C(n, r), the number 
of ways to select an r-subset from an n-set. Recall that we derived the expansion 
of (1 + x)" by first considering the formal multiplication of (a + xy: 
(a+ x)(a+x)(a + xX) = aaa + aax + axa + axx + xaa 
+ XaX + XXA 4+ XXX 


When a = 1, we had 


(1+x)d +x +x) = 111 + 11x 4+ 1x1 + lov t xt 
+ xix + xxl + xxx (1) 


Such a formal expansion lists all ways of multiplying a term in the first factor 
times a term in the second factor times a term in the third factor. The prob- 
lem of determining the coefficient of x" in (1 + x)’, and more generally in 
(1 + x)", reduces to the problem of counting the number of different formal 
products with exactly r xs and (n — r) Is. So the coefficient of x’ in (1 + xp 


is C(3, r), and in (1 + x)" is Cm, r). 
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It is very important that the multiplication of several polynomial factors 
be viewed as generating the collection of all formal products obtained by 
multiplying together a term from each polynomial factor. If the ith polynomial 
factor contains r; different terms and there are n factors, then there will be 
ry Xm Xr; X +++ X rv, different formal products. For example, there will 
be 2” formal products in the expansion of (1 + x)’. 

In the expansion of (1 + x + x’)*, the set of all formal products will be 
sequences of the form: 


XpryX pry Xora x (2) 


that is, a 1 or an x or an x? in each of the entries in the product, such as, x1xx. 

In this chapter we are primarily concerned with multiplying polynomial 
factors in which the powers of x in each factor have coefficient 1, factors such 
as(1 +x +x? + x?) or(1 + x? + x4 + x° + +--+). These factors are completely 
specified by the set of different exponents of the xs. Note that 1 = x°. Thus 
expansion (1) can be rewritten 


(9 + x!(e® + x9 + xl) = 2x x? + OO! + xq ly? + x %yly! 4 gly 
+ xxx! + xlyly® + xlylyl 


And the formal products in Eq. (2) can be written as 


x? x? x? x? 
x Bae oe oe xe or x*1x"2x%3x% Oses2 (3) 
x x? x x? 


The problem of determining the coefficient of x” when we multiply several 
polynomial factors together can be restated in terms of exponents. Consider 
the coefficient of x° in the expansion of (1 + x + x’)*. It is the number of 
different formal products, such as x2x°x*x', formed from Eq. (3) whose sum of 
exponents is 5. Determining this coefficient can be modeled as an integer- 
solution-to-an-equation problem (Example 5 of Section 5.4). The coefficient 
of x° in (1 + x + x’)* is the number of integer solutions to 


e+e, +e, +e,=5 0O=e=2 


where the variable e; represents the exponent of the ith term in the formal 
products formed from Eq. (3). 

According to the equivalent forms of selection-with-repetition discussed 
in Section 5.4, the preceding integer-solution-to-an-equation problem is equiva- 
lent to the problem of selecting five objects from a collection of four types, 
with at most two objects of each type. It is also equivalent to the problem of 
distributing five identical objects into four distinct boxes with at most two objects 
in each box. 
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More generally, the coefficient of x’ in (1 + x + x’)* will be the number 
of integer solutions to 


Qe teatete=r 0Ose¢es2 


or, equivalently, the number of ways of selecting r objects from four types 
with at most 2 of each type (or of distributing r identical objects into four 
boxes with at most 2 objects in any box). Thus (1 + x + x?) is the generating 
function for a,, the number of ways to select r objects from four types with at 
most 2 of each type (or to perform the equivalent distribution). 

At this stage, we are concerned only with how to build generating function 
models for counting problems. In the next section we will see how various 
algebraic manipulations of generating functions permit us to evaluate desired 
coefficients. 

We have shown how the coefficients of (1 + x + x?)* can be interpreted 
as the solutions to a certain selection-with-repetition or distribution-of-identical- 
objects problem. This line of reasoning can be reversed: Given a certain selec- 
tion-with-repetition or distribution problem, we can build a generating function 
whose coefficients are the answers to this problem. We now give some exam- 
ples of how to build such generating functions. We use the intermediate model 
of an integer-solution-to-an-equation to aid in the construction of generating 
function models. 


Example 1 


Find the generating function for a,, the number of ways to select r balls from 
a pile of three green, three white, three blue, and three gold balls. 

This selection problem can be modeled as the number of integer solu- 
tions to 


ej tate te=r 0Ose=3 


Here e, represents the number of green balls chosen, e, the number of white, 
e; blue, and e, gold. We want to construct a product of polynomial factors such 
that when multiplied out formally (as described above), we obtain all products 
of the form x*1x%x*x%, with each exponent e; between 0 and 3. Then we need 
four factors, and each factor should consist of an ‘‘inventory’’ of the powers 
of x from which x* is chosen. That is, each factor should be (x° + x! + x? + 
x°). The desired generating function is thus (x° + x! + x? + 3)for(l +x+ 
P+ x3) 0 


Example 2 


Use a generating function to model the problem of counting all selections of 
six objects chosen from three types of objects with repetition of up to four 
objects of each type. Also model the problem with unlimited repetition. 
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This selection problem can be modeled as the number of integer solu- 
tions to 


€} +e, +e; = 6 0O=¢e=4 


This problem does not ask for the general solution of the ways to select r 
objects. However, in building a generating function, we automatically model 
all values of r, not just r = 6. Wanting a solution to the problem for six ob- 
jects means that we are interested only in the coefficient of x°, that is, the 
ways x°1x%x% can equal x°. We build a generating function with a factor of 
(1 +x + x? + x° + x‘) for each x“. The desired generating function is (1 + 
x +x? + x3 + x‘), and we want the coefficient of x° in it. 

Permitting unlimited repetition means that any number can be chosen of 
each type and so any exponent value is possible (although in this particular 
problem no exponent can exceed 6). From this point of view, the answer is 
the coefficient of x° in (1 + x + x? + x3 + +++}, where ‘‘+ +--+’? means 
that the factor is an infinite series. = 


In the unlimited repetition case above, we could have used the generating 
function (1 + x + x? + x3 + x4+ x5 + x°)', since we wanted only the coefficient 
of x° and thus greater powers of x could not be used. However, we shall see 
in the next section that it is easier to use infinite series in generating functions. 
In selection-with-repetition problems, when we ask for six objects from three 
types with unlimited repetition, we do not add the constraint of at most 6 of 
any type—it is implicit in the problem and there is no need to make it explicit. 
The same situation applies with generating functions. 


Example 3 


Find a generating function for a,, the number of ways to distribute r identical 
objects into five distinct boxes with an even number of objects not exceeding 
10 in the first two boxes and between three and five in the other boxes. 

While the constraints may be a bit contrived in this example, it is still 
easy to model the problem as an integer-solution-to-an-equation problem with 
appropriate constraints. We want to count all integer solutions to 


e:t+e+etete=r €1, €, even, O0Se,,e = 10, 


3. ey, €4; 5 S 5 


To generate all formal saa the form x‘1x°x%x%x*s, with the given 
constraints on the es, we need a’product of five factors, each containing the 
inventory of the powers of x permitted for its x*. For example, the inventory 
for x* is (1 + x? + x4 + x® + x® + x). The required generating function is 
g(x) = (CL +x? + axt + xo + x8 + xl)? 3 + x4 + 25), 


With a little practice, generating function models become simple to build. 
However, the concept behind generating functions is far from simple. In the 
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previous chapter, we solved similar selection and distribution problems with 
explicit formulas involving binomial coefficients. Now we are modeling these 
problems as some coefficient, which represents the number of formal products 
in the multiplication of certain polynomial factors. All we write down is the 
polynomial factors. 

A generating function cannot be designed to model a selection or distribu- 
tion problem for just one amount, say, six objects. It must model the problem 
for all possible numbers of objects. A generating function model stores all the 
necessary information about these subproblems in one function. 

In the next section, determining the value of a specified coefficient of such 
a function will be reduced to a series of rote algebraic operations. The fact 
that the function models a complex selection or distribution problem will be 
irrelevant. The combinatorial reasoning arises solely in the construction of 
generating functions. 


6.1 EXERCISES 


Mb Summary of Exercises The first 21 exercises involve simple 


generating function modeling. The remaining problems involve more challeng- 
ing modeling; Exercises 25-29 use multinomial generating functions. 


1. For each of the following expressions, list the set of all formal products 
in which the exponents sum to 4. 


(a) 0d +x4+ x71 + xy 

(b) 1 +x +x? 4+ x2 4+ x49 
() 1 +x274+ x47 +x 4+ xP 
(a) (lt et eae ep 


2. Build a generating function for a,, the number of integer solutions to the 
equations: 


(a) e, ta t+tetetes=r, 0Oses4 
(b) e, +e. +e, = 4, 0<e<4 


(c) ey te, +e; +e, =r, 2=e=8 e, even, e, odd 
(d)epteatete=r, 0=e 
(e) ey ae €y +e, 4+ Cp = 7; Oo< (a eo, e, odd, a= 3 


3. Build a generating function for a,, the number of r selections from a pile 
of: 


(a) Three red, four black, and four white balls. 


(b) Five jelly beans, three licorice sticks, eight lollipops with at least one 
of each candy. : 
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10. 


11. 


12. 


13. 


14, 


(c) Unlimited amounts of pennies, nickels, dimes, and quarters. 
(d) Seven types of lightbulbs with an odd number of the first and second 
types. 


. Build a generating function for a,, the number of distributions of r identical 


objects into: 

(a) Five different boxes with at most four objects in each box. 

(b) Four different boxes with between three and eight objects in each box. 
(c) Seven different boxes with at least one object in each box. 

(d) Three different boxes with at most five objects in the first box. 


. Use a generating function for modeling the number of 5-combinations of 


the letters M, A, T, H in which M and A can appear any number of times 
but T and H appear at most once. Which coefficient in this generating 
function do we want? 


. Use a generating function for modeling the number of different selections 


of r hot dogs when there are five types of hot dogs. 


. Use a generating function for modeling the number of distributions of 18 


chocolate bunny rabbits into four Easter baskets with at least 3 rabbits in 

each basket. Which coefficient do we want? 

(a) Use a generating function for modeling the number of different election 
outcomes in an election for class president if 27 students are voting 
among four candidates. Which coefficient do we want? 

(b) Suppose each student who is a candidate votes for herself or himself. 
Now what is the generating function and the required coefficient? 


(c) Suppose no candidate receives a majority of the vote. Repeat part (a). 


. Find a generating function for a,, the number of r-combinations of an n- 


set with repetition. 


Given one each of u types of candy, two each of uv types of candy, and 
three each of w types of candy, find a generating function for the number 
of ways to select r candies. 

Find a generating function for a,, the number of k-combinations of n types 
of objects with an even number of the first type, an odd number of the 
second type, and any amount of the other types. 

Find a generating function for a,, the number of ways to distribute r 
identical objects into g distifict boxes with an odd number between r, and 
5, in the first box, an even number between r, and s, in the second box, 
and at most three in the other boxes. 

Find a generating function for a,, the number of ways n distinct dice can 
show a sum of r. 

Find a generating function for a,, the number of ways a roll of six distinct 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24, 


25. 


6.1 Generating Function Models 249 


dice can show a sum of r if: 
(a) The first three dice are odd and the second three even. 
(b) The ith die does not show a value of i. 


Build a generating function for a,, the number of integer solutions to e, + 
&+tete=r,-35¢e53., 


Find a generating function for the number of integers between 0 and 999,999 
whose sum of digits is r. 


Find a generating function for the number of selections of r sticks of 
chewing gum chosen from eight flavors if each flavor comes in packets of 
five sticks. 


(a) Use a generating function for modeling the number of distributions of 
20 identical balls into five distinct boxes if each box has between 2 
and 7 balls. 


(b) Factor out an x’ from each polynomial factor in part (a). Interpret this 
revised generating function combinatorially. 


Use a generating function for modeling the number of ways to select five 
integers from 1, 2,..., m, no two of which are consecutive. Which coeffi- 
cient do we want for n = 20? For a general n? 


Explain why (1 + x + x? + +++ + x’) is not a proper generating function 
for a,, the number of ways to select r objects from four types with repetition. 
What is the correct generating function? 


Explain why (1 + x + x? + x’ + x4)’ is not a proper generating function 
for the number of ways to distribute r jelly beans among r children with 
no child getting more than four jelly beans. 


Show that the generating function for the number of integer solutions to 
aetetrteate=rO0f=e Sens e,5 &, is 


GI+xtx2+--)d te txtt--sy 

(+x +x8t-- Jd txtt x8 +--+) 
Find a generating function for the number of ways to make r cents change 
in pennies, nickels, and dimes. 


A national singing contest has 5 distinct entrants from each state. Use a 
generating function for modeling the number of ways to pick 20 semifinal- 
ists if: 

(a) There is at most 1 person from each state. 

(b) There are at most 3 people from each state. 

Find a generating function g(x, y) whose coefficient of x'y* is the number 


of ways to distribute r chocolate bars and s lollipops among five children 
such that no child gets more than three lollipops. 
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26. 


27. 


28. 


29 


(a) Find a generating function g(x, y, z) whose coefficient x"y°z' is the 
number of ways to distribute r red balls, s blue balls, and ¢ green balls 
to n people with between three and six balls of each type to each 
person. 

(b) Suppose also that each person gets at least as many red balls as blues. 

(c) Suppose also that the first three people get equal numbers of reds and 
blues. 


(d) Suppose also that no one gets the same number of green and red balls. 
Find a generating function g(x, y, z) whose coefficient of x"y*z' is the number 


of ways eight people can each pick two different fruits from a bowl of 
apples, oranges, and bananas for a total of r apples, s oranges, and ¢ bananas. 


Find a generating function (X,, %, ..., %m) Whose coefficient of x}ix?... 
x'm is the number of ways 7 people can pick a total of r; chairs of type 1, 
r, chairs of type 2,..., 7, chairs of type m if: 


(a) Each person picks one chair. 

(b) Each person picks either two chairs of one type or no chairs at all. 
(c) Person i picks up to i chairs of exactly one type. 

If g(%, %2,.--,%)) = (1 + + +++ + x,)", p > n, how many terms of 


g(X,,...,%,)’s expansion have no exponent of any x; greater than 1? What 
is the coefficient of one of these terms? 


6.2 CALCULATING COEFFICIENTS 
OF GENERATING FUNCTIONS 


We now develop algebraic techniques for calculating the coefficients of generat- 
ing functions. All these methods seek to reduce a given generating function 
to a simple binomial-type generating function or a product of binomial-type 
generating functions. For easy reference, we list first all the polynomial identities 
and expansions to be used in this section. They will be explained below. 


Polynomial Expansions 


(1) 


(2) 


(3) 


+1 
meh ee er eee he 


1 — x” 
1- 


1 


1- 


atoratt (Mes (Meret (Meret (Mx 


a Ue a ae aes Sa 
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(4) -x"y"=1- (")e + (3) pened ay(7) + 
+ ay()em 
n 


ae oe Copa. aes se ae ; 
Gina 1+ 1 x 7 xt + x 


(6) If A(x) = f(x)g(x), where f(x) = ay + ayx + ayx? + +++ and g(x) = 
by + Dix + box? +--, 


then 


h(x) = agby + (aiby + agb,)x + (aby + ayb, + and)? +++ 
+ (a,bo + a,—\0, + a,b, a Ayb,)x" tees 


The rule for multiplication of generating functions in Eq. (6) is simply the 
standard formula for polynomial multiplication. Identity (1) can be verified by 
polynomial ‘‘long division.’’ We restate it, multiplying both sides of Eq. (1) 
by (1 — x), asl —x™) = (1 — x) +x +x? +--+ + x"). We verify 
that the product of the right-hand side is 1 — x”*! by ‘‘long multiplication.’ 


Let ee bes ee" 


l-x 

Lex att oe a (*) 

—x— x2 — x3 eH yma tl 

1 = xt 
If m is made infinitely large, so that 1 + x + x? + +++ + x” becomes the 
infinite series 1 + x + x? +--+, then the multiplication process (*) will yield 
a power series in which the coefficient of each x‘, k > 0, is zero [the reader 
can confirm this in (*)]. We conclude that (1 — x1 +x +x?74+ +--+) =1. 


[Numerically, this equation is valid for |x| < 1; the ‘‘remainder’’ term x”*! in 
(*) goes to 0 as m becomes infinite.] Dividing both sides of this equation by 
(1 — x) yields identity (2). 

Expansion (3), the binomial expansion, was explained at the start of Section 
6.1. Expansion (4) is obtained from (3) by expanding (1 + y)", where y = 
—x is . 


[1+ (-x")"=1+ (")- 7 (s)cey 


Se Geedl Se (")-ey feed (")(—xey 
k n 


Expansion (4) follows immediately. 
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By identity (2), (1 — x)”", or equivalently, (.) , equals 


GQtxtxr>tx3t-+-) (7) 


Let us determine the coefficient of x’ in Eq. (7) by counting the number of 
formal products whose sum of exponents is r. If e; represents the exponent of 
the ith term in a formal product, then the number of formal products 
x"x2x%,. x whose exponents sum to r is the same as the number of integer 
solutions to the equation 


e+et+teat-:-+e=r e; = 0 


In Example 5 of Section 5.4, we showed that the number of nonnegative 
integer solutions to this equation is C(r + n — 1, r). Thus the coefficient of 
x" in Eq. (7) is C(r + n — 1, r). This verifies expansion (5). 

With formulas (1) to (6) we can determine the coefficients of a variety of 
generating functions: First, perform algebraic manipulations to reduce a given 
generating function to one of the forms (1 + x)", (1 — x”)", or (1 — x)", or 
a product of two such expansions; then use expansions (3) to (5) and the product 
rule (6) to obtain any desired coefficient. We illustrate some common reduction 
methods in the following examples. 


Example 1 


Find the coefficient of x'® in (x? + x3 + x4 + ++ -+)5, What is the coefficient 
of x"? 

To simplify the expression, we extract x? from each polynomial factor and 
then apply identity (2). 

(P+ xP txt tes P= [rd txt x2+-+-yp 

a an 
(lay. 
Thus the coefficient of x'® in (x? + x3 + x4 + ---)5 is the coefficient of x' 
in x'"(_ — x)~*. But the coefficient of x'® in this latter expression will be the 
coefficient of x° in (1 — x)~* [i.e., the x° term in (1 — x)~> is multiplied by x! 
to become the x'® term in x'°(1 — x)~*]. From expansion (5), we see that the 
coefficient of x® in (1 — x) is C(6 + 5 — 1, 6). 

More generally, the coefficient of x" in x'°(1 — x)~> equals the coefficient 
of x" in (1 — x), namely, C((r — 10) + 5 — 1, (r — 10)). = 


H=x1Ltxtx27 +--+ - p= xl 


Observe that (x? + x3 + x4 + +++ )° is the generating function for the 
number of ways to select r objects with repetition from five types with at least 
2 of each type. In the last chapter, we solved such a problem by first picking 
two objects in each type—1 way—and then counting the ways to select the 
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remaining r — 10 objects—C((r — 10) + 5 — 1, (” — 10)) ways. In the 
generating function analysis in Example 1, we algebraically picked out an x? 
from each factor for a total of x!° and then found the coefficient of x"! in 
(1 +x + x? + +++), the generating function for selection with unrestricted 
repetition of r from five types. 

The standard algebraic technique of extracting the highest common power 
of x from each factor corresponds to the ‘‘trick’’ used to solve the associated 
selection problem. Such correspondences are a major reason for using generating 
functions: the algebraic techniques automatically do the combinatorial reasoning 
for us. 


Example 2 


Use generating functions to find the number of ways to collect $15 from 20 
distinct people if each of the first 19 people can give a dollar (or nothing) and 
the twentieth person can give either $1 or $5 (or nothing). 

The generating function for the number of ways to collect r dollars from 
these people is (1 + x)!(1 + x + x5), We want the coefficient of x >. The first 
part of this generating function has the binomial expansion 


(l+x)?=1+ (V)e+ (P)e+ teed (2) veep ({3)= 


If we let f(x) be this first polynomial and let g(x) = 1 +x + x°, then we can 
use Eq. (6) to calculate the coefficient of x" in h(x) = f(x)g(x). Let a, be the 
coefficient of x’ in f(x) and b, the coefficient of x’ in g(x) . We know that 


a, = and that by = b,; = bs = 1 (other 5s are Zero). 
Then the coefficient of x5 in A(x) is 
Qisbp + aid) + ayzbp +o + Ad 5 
which reduces to 
413d) + ab, + ayb; 


since bo, b,, bs are the only nonzero coefficients in 8 (x). Substituting the values 
of the various as and bs in Eq. (8), we have 


Us) 1+ (14) 4+ (to) 1=(19)+(2) +(32) 


Example 3 


How many ways are there to distribute 25 identical balls into seven distinct 
boxes if the first box can have no more than 10 balls but any amount can go 
into each of the other six boxes? 
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The generating function for the number of ways to distribute r balls into 
seven boxes with at most 10 balls in the first box is 


(he ee ee ee a ee ey 


Ref oer oe ee 1 \' 
(V4) = (4) 


using identities (1) and (2). We want the coefficient of x”. Let f(x) = 1 — x"! 
and g(x) = (1 — x)’. Using expansion (5), we have 


gepaa-atait (TPT ee (242 oN 


ed eee ae 
7: 


We want the coefficient of x* (25 balls distributed) in h(x) = f(x)g(x). 
As in Example 2, we need to consider only the terms in the product of the two 


7 
polynomials (1 — x!') and (4) that yield an x* term. The only nonzero 


coefficients in f(x) = (1 — x") are ay = 1 and a,, = —1. So the coefficient 
of x in f(x)g(x) is: 
Agbrs + ay bi, 
25+7-1 1444+7-1 
Sx +(-1)xX 
ix 25 )reox{ a 


The combinatorial interpretation of the answer in Example 3 is that we 
count all the ways to distribute (without restriction) the 25 balls into the seven 
boxes, C(25 + 7 — 1, 25) ways and then subtract the distributions that violate 
the first box constraint, that is, distributions with at least 11 balls in the first 
box, C((25 — 11) + 7 — 1, (25 — 11)) (first put 11 balls in the first box and 
then distribute the remaining balls arbitrarily). Again, generating functions 
automatically performed this combinatorial reasoning. = 


The next example employs all the techniques used in the first three examples 
to solve a problem that cannot be solved by the combinatorial methods of the 
previous chapter. 


Example 4 


How many ways are there to select 25 toys from seven types of toys with 
between two and six of each type? 
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The generating function for a,, the number of ways to select r toys from 
seven types with between 2 and 6 of each type, is 


Cre ak te ey 
We want the coefficient of x™. As in Example 1, we extract x* from each factor 
to get 
[PU tax tx? t xP + xSP = x1 tix tx? + x3 + x4) 
Now we reduce our problem to finding the coefficient of x" = x" in (1 + 


x +x? + x3 + x‘). Using identity (1), we can rewrite this generating function 
as 


ZaoN 7 
(txtetee ty (2) == x9 : 


—x 1-x 


Let f(x) = (1 — x*)’ and g(x) = (1 — x). By expansions (4) and (5), 
respectively, we have 


fey na—xy=i— (Tare (Jem (ere 
ey re es oe 1+7-1 2+7-1 =P. 
eo= (7) =1+( )r+( : Jit 


nee oe 
r 


To find the coefficient of x'!, we need to consider only the terms in the 
r 
product of the two polynomials (1 — x°*)’ and (4) that yield x''. The only 


nonzero coefficients in f(x) = (1 — x)’ witha subscript = 11 (larger subscripts 
can be ignored) are dp, as, and a4 [see the expansion of f(x) above]. The 
products involving these three coefficients that yield x!' terms are: 


apd, + asbs + AyD 


11+7-1 7 6+7-1 a) 1+7-1 
To Waa )+(-()) x( 6 )+(2)( ). 
The following combinatorial interpretation can be given to the final answer 
in Example 4. The first term, C(11 + 7 — 1, 11), counts the number of ways 
to select 11 toys from seven types of toys with no restriction, where 11 is the 
number of additional toys to select after we first pick 2 toys of each type. The 
next term, —7C(6 + 7 — 1, 6), subtracts seven cases. of a violation where we 


Pick at least 5 additional toys of some type (we pick 5 of some type and then 
pick 11 — 5 = 6 more toys from the 7 types). The final term, C(7,2)9C0 + 
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7 — 1, 1) adds back on all C(7,2) cases where some pair of violations occurred, 
that is, with at least 5 chosen from a pair of types [we pick 5 of two types and 
then pick 11 — (2 X 5) = 1 more toy from the 7 types]. The logic behind this 
combinatorial approach will be developed in Chapter 8. 

We close this section by showing how generating functions can be used 
to verify binomial identities. We express the right side of the identity as 
a particular coefficient in some generating function h(x) and the left side as 
the same coefficient in a product of generating functions f(x) and g(x), where 


A(x) = f(x)gQ). 


Example 5 
Verify the binomial identity 


2 2 2 
i) eee 
0 1 n n 

The right-hand side of the identity is the coefficient of x” in (1 + x)". The 
left-hand side terms involves coefficients in (1 + x)". The product of generating 
functions we want is (1 + x)"(1 + x)”. That is, let f(x) = g(x) = (1 + x)" so 
that f(x)g(x) = h(x) = (1 + x)”. Then a, = b, = C(n, r). By the product rule 
(6), the coefficient of x” in f(x)g(x) is 


agb, + ayb,- SS ele a,Do 


(NN OO) 


Equating coefficients of x” in (1 + x)"(1 + x)” = (1 + x)", we obtain the 
required identity. = 


6.2 EXERCISES 


Mb Summary of Exercises The first 25 exercises are similar to 


the examples of coefficient calculation in this section. Exercises 27—32 involve 
binomial identities (several of these problems are quite tricky). Exercises 34—38 
and 40 introduce the topic of probability generating functions (some background 
in probability is helpful). 

1. Find the coefficient of x8 in(1 +xt+x74+°24+---)% 

2. Find the coefficient of x7 in (> + x® + x7 +---) 

3. Find the coefficient of x? in (1 + x? + x4\(1 + x)”. 


12. 
13. 


14. 


15. 


16. 
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. Find the coefficient of x” in ( + x2 + x3 + x4 4 XY? + xP + yt + 


F “p 


. Find the coefficient of x'® in (x? + x3 + x4 + x3 + yO 4 xy 
. Find the coefficient of x in Q!9 + x! +--+. 4 XY + xr freee yh 


x(x 0 a 2 x*), 


. Find the coefficient of x? in: 


x+3 
1—2x+ x? 
x? — 3x b™™ 
(di — x! © G— bxy™! 
(h=x'y 
(1 — x) 


(a) x1 — x)" (d) 


(b) 


(c) 


. Give a formula similar to Eq. (1) for: 


(a) Lt+xt + x8 +--+ 4+ 4% (b) x? + x + +++ + x 180 
Find the coefficient of x° in (x? + x3 + x4 4+ xy, 

Find the coefficient of x" in (1 + x3 + x5 +49 +... y. 

Find the coefficient of x! in: 

(a) (1 — x) (d) (1 — 4x) 

(b) (1 + x)! (e) (1 + x*)~4 

(ce) 1 +x)° 


Find the coefficient of x in (1 + x3 + x8)!9, 


Use generating functions to find the number of ways to select 10 balls 
from a large pile of red, white, and blue balls if: 


(a) The selection has at least two balls of each color. 

(b) The selection has at most two red balls. 

(c) The selection has an even number of blue balls. 

Use generating functions to find the number of ways to distribute r jelly 
beans among eight children if: 

(a) Each child gets at least one jelly bean. 

(b) Each child gets an even number of beans. 

How many ways are there to place an order for 12 chocolate sundaes if 
there are 5 types of sundaes, and at most 4 sundaes of one type are allowed? 


How many ways are there to paint the 10 identical rooms in a hotel with 
five colors if at most three rooms can be painted green, at most three 
painted blue, at most three red, and no constraint on the other two colors, 
black and white? 
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17. 


How many ways are there to distribute 20 cents to n children and one 
parent if the parent receives either a nickel or a dime and: 


(a) The children receive any amounts? 
(b) Each child receives at most 1¢? 


. How many ways are there to get a sum of 25 when 10 distinct dice are 


rolled? 


. How many ways are there to select 300 chocolate candies from seven types 


if each type comes in boxes of 20 and if at least one but not more than 
five boxes of each type are chosen? (Hint: Solve in terms of boxes of 
chocolate.) 


. How many different committees of 40 Senators can be formed if the two 


Senators from the same state (50 states in all) are considered identical? 


. How many ways are there to split 6 copies of one book, 7 copies of a 


second book, and 11 copies of a third book between two teachers if each 
teacher gets 12 books and each teacher gets at least 2 copies of each book? 


. How many ways are there to divide five pears, five apples, five doughnuts, 


five lollipops, five chocolate cats, and five candy rocks into two (unordered) 
piles of 15 objects each? 


. How many ways are there to collect $24 from 4 children and 6 adults if 


each person gives at least $1, but each child can give at most $4 and each 
adult at most $7? 


. Ifacoin is flipped 25 times with eight tails occurring, what is the probability 


that no run of six (or more) consecutive heads occurs? 


. If 10 steaks and 15 lobsters are distributed among four people, how many 


ways are there to give each person at most 5 steaks and at most 5 lobsters? 


. Show that (1 — x — x? — x? — x+ — x° — x°)”! is the generating function 
g & 


for the number of ways a sum of r can occur if a die is rolled any number 
of times. 


. Use generating functions to show that 


> 


. We i ) = (" : " 


. Use the equation 


(1— x)" _ ‘ 
(=x (+x) 


to show that 


m/2 = = 
rena (4 | ae “a =) m =nandmeven 
=0 


. 29, 


30. 


31. 


32. 


33. 
34. 


35. 


36. 


37. 


38. 
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Use binomial expansions to evaluate: 


(a) alee (b) Tene) (c) S20) 


(a) Evaluate S (‘ : ) (b) Evaluate s ie 7 i): 
k 


=n k=0 \M — k 


co k _ od k 
(a) Show that » (3) C ae 7 = 2". (b) Evaluate S (3) k. 
f=0 \2 k =o \2 


Why cannot one set x = —1 in formula (5) to ‘‘prove’’ that 


I\i_ So fnt+k-1 
Gray ae ) 


If g(x) is the generating function for a,, then show that g®(0)/k! = a,. 


A probability generating function px(t) for a discrete random variable X 
has a polynomial expansion in which P,, the coefficient of t’, is equal to 
the probability that X = r. 

(a) If X is the number of heads that occur when a fair coin is flipped n 
times, show that py(t) = ()"(1 + 1)". 

(b) If X is the number of heads that occur when a biased coin is flipped 
n times with probability p of heads (and q = 1 — p), show that 
Px(t) = (q + pt)”. 

(c) If X is the number of times a fair coin is flipped until the fifth head 
occurs, find p,(t). 


(d) Repeat part (c) until the mth head occurs and probability of a head is 
D. 

(a) The expected value E(X) of a discrete random variable X is defined to 
be Sp,r. Show that E(X) = px(1), that is, é Px(t), with t set equal to 


1. 
(b) Find E(X) for the random variables X in Exercise 34. 
(a) The second moment E,(X) of a discrete random variable X is defined 
to be Xp,r?. Show that E(X) = px(1) + p%1). 
(b) Find £,(X) for the random variables X in Exercises 34(b) and 34(c). 
Suppose a fair coin is flipped until the mth head occurs and suppose that 
no more than s tails in a row occur. If X is the number of flips, find P,(t). 


Experiments A’ and A” have probabilities p’ and p” of success in each trial 
and are performed n’ and n” times, Tespectively, Let X’ and X” be the. 
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number of successes in the respective experiments, and let X be the total 
number of successes on both experiments. Verify the following. 


(a) px(t) = px'(Dpx"O 
(b) E(X) = E(X') + EX") 
(c) E,(X) = E{X') + EX") + E(X’X") 


39. Suppose a red die is rolled once and then a green die is rolled as many 
times as the value on the red die. If a, is the number of ways that the 
(variable length) sequence of rolls of the green die can sum to r, show that 
the generating function for a, is f(f(x)), where f(x) = (x + e+ xt 
xt + x9 + x), 

40. Suppose X is the random variable of the number of minutes it takes to 
serve a person at a fast-food stand. Suppose Y is the random variable of 
the number of people who line up to be served at the stand in one minute. 
Let Z be the number of people who line up while a person is being served. 
Show that p2(t) = px(pr(4). 


6.3 PARTITIONS 


In this section we discuss partitions and their generating functions. Unfortu- 
nately, there is no easy way to calculate the coefficients of most of these 
generating functions. A partition of a group of r identical objects divides the 
group into a collection of (unordered) subsets of various sizes. Analogously, 
we define a partition of the integer r to be a collection of positive integers 
whose sum is r. Normally we write this collection as a sum and list the integers 
of the partition in increasing order. For example, the seven partitions of the 
integer 5 are 


1+1+1+1+1 14+2+2 23 5 
1+14+1+2 1+1+3 1+4 


Note that 5 is a ‘“‘trivial’’ partition of itself. 

Let us construct a generating function for a,, the number of partitions of 
the integer r. A partition of an integer is described by specifying how many 
1s, how many 2s, and so on, are in the sum. Let e denote the number of ks 
ina partition. Then 

le, + 2e) + 3e3 +--+ the toc+ tre, =r 
Intuitively, we can think of picking r objects from an unlimited number of 
piles where the first pile contains single objects, the second pile contains objects 


stuck together in pairs, the third pile contains objects stuck together in triples, 
and so on. To model this integer-solution-to-an-equation problem with a generat- 
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ing function, we need polynomial factors whose formal multiplication yields 
products of the form 


x (x7)? (x7) (x*/? 
x! (x7)! (x3)! (x)! 


x2 (x?) Oy cee (x*? 


Thus the generating function g(x) must be 
BHU txt txeters tutt---) 
SCL eet et eX Pte = Set peas) 
1th txSt x2 textes) 


oO a le sls ea ae ea | 


If we set y = x’, then the second factor in g(x) becomes 1 + ytyt--- 
= (1 — y)!. Thus 
Ptr txttxStee tame --- = - x) 
Similarly, the kth factor can be written as (1 — x)". For partitions up to 


r = m, we need the first m polynomial factors. But for arbitrary values of r, 
we need an infinite number of polynomial factors. Then 


we 1 
i Gs pd dl Se 


We now consider some more specialized partition problems. 


Example 1 


Find the generating function for a,, the number of ways to express 7 aS a sum 
of distinct integers. 

We must constrain the standard partition problem not to allow any repetition 
of an integer. For example, there are three ways to write 5 as a sum of distinct 
integers 2 + 3, 1 + 4, and 5. The appropriate modification of the generating 
function for unrestricted partitions is 


B(x) = 1 + 1 + 2°) + 8) txt) txt) 


Example 2 


Find a generating function for a,, the number of ways that we can choose 2¢, 
3¢, and 5¢ stamps adding to a net value of r¢. 
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The problem is equivalent to the number of integer solutions to 
2e, + 3e, + Ses =r 0 = e, &3, @5 
The appropriate generating function is 
(Qtxrertxttxo te) te tro +x? +- °°) 
“dt xe txt xh tess) a 


Example 3 


Show with generating functions that every positive integer can be written as a 
unique sum of distinct powers of 2. 

The generating function for a,, the number of ways to write an integer r 
as a sum of distinct powers of 2, will be similar to the generating function for 
sums of distinct integers in Example 1, except that now only integers that are 
powers of 2 are used. The generating function is 


w= tod te tx) tx)---dtx)ee: 


To show that every integer can be written as a unique sum of distinct 
powers of 2, we must show that the coefficient of every power of x in g*(x) 
is 1. That is, show that 

1 
1-x 


gt(yeltxtvetexe ter = 


or equivalently 
(1 — x)g*(x) = 1 
ie prove: this identity by repeatedly using the factorization (1 — x‘) + x*) 
=1-—x*, 
Gd —-xdg*t@) =U -xnd +00 +) + x40 + Eon eee 
=[ d-2x) Jd+2x0 + x41 + 2°)... 
=[ (1 — x‘) Jd + x4)(1 + x8)... 


=] 


By successively making the replacement (1 — x‘) + x) = 1 — x*%, we 
eventually eliminate all factors in (1 — x)g*(x). Formally, the coefficient of 
any specific x‘ in (1 — x)g*() must be 0. So (1 — x)g*(x) = 1 and g*(x) = 
lt+xt+x+--+-, as required. = 


A convenient tool for studying partitions is a diagram known as Ferrers 
diagram. A Ferrers diagram displays a partition of r dots in a set of rows 
listed in order of decreasing size. The partition 1 + 2 + 2 + 3 + 7 of 15 is 
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Figure 6.1 (a) (b) 


shown in the Ferrers diagram in Figure 6.la. If we transpose the rows and 
columns of a Ferrers diagram of a partition of r, we get a Ferrers diagram of 
another partition of r. This diagram is called the conjugate of the original 
Ferrers diagram. For example, Figure 6.15 shows the conjugate of the Ferrers 
diagram in Figure 6.la. This new Ferrers diagram represents the partition of 
I5,1+1+14+1+24+445, Clearly, transposing is unique: Two Ferrers 
diagrams have equal conjugates if and only if they are equal. 


Example 4 


Show that the number of partitions of an integer r as a sum of m positive 
integers is equal to the number of partitions of r as a sum of positive integers, 
the largest of which is m. 

If we draw a Ferrers diagram of a partition of r into m parts, then the 
Ferrers diagram will have m rows. The transposition of such a diagram will 
have m columns, that is, the largest row will have m dots. Thus there is a one- 
to-one correspondence between these two classes of partitions. m= 


6.3 EXERCISES 


Summary of Exercises Exercises 1-1] involve generating 
function models for partitions. The next six exercises use Ferrers diagrams to 
verify partition identities. The next five exercises are more difficult partition 
problems. 


1. List all partitions of the integer: (a) 4, (b) 6. 
2. Find a generating function for a,, the number of partitions of r into: 
(a) Even integers. (b) Distinct odd integers. 


3. Find a generating function for the number of ways to write the integer r 
as a sum of positive integers in which no integer appears more than three 
times. 
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4. 


10. 


11. 


12. 


13. 


14. 


15. 


Find a generating function for the number of integer solutions of 2x + 
3y + 7z = r with: 
(a) x,y,z 20 (b) OS 272825 y282£x 


. Find a generating function for the number of ways to make r cents change 


in pennies, nickels, dimes, and quarters. 


. Find a generating function for the number of ways to distribute r identical 


objects into: 
(a) Three indistinguishable boxes. 
(b) 7 indistinguishable boxes (n = r). 


. (a) Show that the number of partitions of 10 into distinct parts (integers) 


is equal to the number of partitions of 10 into odd parts by listing all 
partitions of these two types. 


(b) Show algebraically that the generating function for partitions of r into 
distinct parts equals the generating function for partitions of r into odd 
parts, and hence the numbers of these two types of partitions are equal. 


Show with generating functions that every positive integer has exactly 
one partition into distinct powers of 10; that is, it has a unique decimal 
representation. 


(a) Prove the result in Example 3 by induction. 
(b) Prove the result of Example 3 directly by recursively substituting 
d+ 2x45 = (1 — x)/d — x*) in g*(). 


The equation in Example 3, g*(x)(1 — x) = 1, can be rewritten 1 — x = 
1/g*(x). Use this latter equation to prove that among all partitions of an 
integer r, r = 2, into powers of 2, there are as many such partitions with 
an odd number of parts as with an even number of parts. 


Let R(r, k) denote the number of partitions of the integer r into k parts. 
(a) Show that R(r, kK) = Rr -— 1,k — 1) + RO — kk). 

(b) Show that >4.,R(7 — r, k) = R(n, r). 

Use a Ferrers diagram to show that the number of partitions of an integer 


into parts of even size is equal to the number of partitions into parts such 
that each part occurs an even number of times. 


Interpret the integer multiplication mn, ‘‘m times n,’’ to be the sum of m 
ns. Prove that mn = nm. 


Show that the number of partitions of the integer 7 into three parts equals 
the number of partitions of 2n into three parts of size <n. 


Show that the number of partitions of 7 is equal to the number of partitions 
of 2n into n parts. 


16. 


17. 


18. 


19, 


20. 


21. 


22. 
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Show that any number of partitions of 2r + k into r + k parts is the same 
for any k. 


Show that the number of partitions of r + k into k parts is equal to: 
fe +1). ; 
(a) The number of partitions of r + (* > ' into k distinct parts. 


(b) The number of partitions of r into parts of size < k. 


Show that the number of ordered partitions of n is 2”~!. For example, the 
ordered partitions of 4 are: 4, 1 + 3,2 + 2.3+1,1+1+2,1+2+4 
1,2+ 1+ 1,1+ 1+ 1+ 1. (Aint: Write n 1s in a row and determine 
all the ways to partition this sequence into clusters of 1s. 


(a) Find a generating function for a,, the number of partitions that add up 
to at most n. 


(b) Find a generating function for a,, the number of partitions of n into 
three parts in which no part is larger than the sum of the other two. 


(c) Find a generating function for a, , the number of different (incongruent) 
triangles with integral sides and perimeter n. 


Show that 2(1 — x)"[(1 — x)? + (1 + x] is the generating function 
for the number of ways to toss r identical dice and obtain an even sum. 


(a) A partition of an integer r is self-conjugate if the Ferrers diagram of 
the partition is equal to its own transpose. Find a one-to-one correspon- 
dence between the self-conjugate partitions of r and the partitions of 
r into distinct odd parts. 


(b 


— 


The largest square of dots in the upper left-hand corner of a Ferrers 
diagram is called the Durfee square of the Ferrers diagram. Find a 
generating function for the number of self-conjugate partitions of r 
whose Durfee square is size k (ak X k array of dots). (Hint: Use a 
1—1 correspondence between these and the partitions of r — k? into 
even parts of size at most 2k.) 


(c) Show that 
Ch CLs eC eee, 
=1+ 


2 
x* 


» (1 — x7) — x4 — x8)... — 5) 


Let {rt denote the number of partitions of n objects into k nonempty 


n+ 1 n n 
subsets. Show iat { k } =k {nr} + ie 
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23. Write a computer program to determine the number of all partitions of an 
integer r: 
(a) Into & parts. 
(b) Into any number of parts. 
(c) Into distinct parts. 


— 


6.4 EXPONENTIAL GENERATING FUNCTIONS 


In this section we discuss exponential generating functions. They are used to 
model and solve problems involving arrangements and distributions of distinct 
objects. The generating functions used in the previous sections to model selec- 
tion with repetition problems are called ordinary generating functions. Expo- 
nential generating functions involve a more complicated modeling process than 
do ordinary generating functions. Consider the problem of finding the number 
of different words (arrangements) of four letters when the letters are chosen 
from an unlimited supply of as, bs and cs, and the word must contain at least 
two as. The possible sets of four letters to form the word are {a, a, a, a}, {a, 
a, a, b}, {a, a, a, c}, {a, a, b, b}, {a, a, b, c}, and {a, a, c, c}. The number 
of arrangements possible with each of these six sets is 


4! 4! 4! 4! 4! 4! 
4!0!0! 31110! 31011! 2!2!0! 2'111! 21012! 


respectively. So the total number of words will be the sum of these six terms. 

The sets of letters used in such a word range over all sets of four letters 
chosen with repetition from as, bs and cs with at least two as. Equivalently, 
the number of such sets equals the number of integer solutions to the equation 


ej tea+e=4 2=6é, 0 = &, e3 (1) 


At first sight, the four-letter words problem seems similar to previous problems 
that could be modeled by (ordinary) generating functions. However, in this 
case we do not want each integer solution of Eq. (1) to contribute 1 to the 
count of the number of possible words. Instead it must contribute 4!/(e,!e2!e3!) 
words. In terms of generating functions, the coefficient of x* will count all 
formal products with associated coefficients 


+ e, + 3)! 
(e€) + €2 es)! 


212% 2=e, 0 S e€, e3 
e, leases! 


whose exponents sum to 4 (a much harder problem). Fortunately, exponential 
generating functions yield formal products of exactly this form. 
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An exponential generating function g(x) for a,, the number of arrange- 

ments with r objects, is a function with the power series expansion 

3 r 
gi) =a tax tae + ak + “5S ta tae 

We build exponential generating functions in the same way that we build 
ordinary generating functions: one polynomial factor for each type of object; 
each factor has a collection of powers of x that are an inventory of the choices 
for the number of objects of that type. However, now each power x’ is divided 
by r!. 

As an example, let us consider the four-letter word problem with at least 
two as. We claim that the exponential generating function for the number of 
r-letter words formed from an unlimited number of as, bs, and cs containing 
at least two as is 


x x3 x4 x? x3 x4 5 

soa (Grptge (reeks S as) (2) 
The coefficient of x’ in Eq. (2) will be the sum of all products 
(x*/e!)(x2/e,!)(x%/es!), where e, +e, +e; =7,2S €},0 S e,, e3. If we divide 
x’ by r! and compensate by multiplying its coefficient by r!, then the x” term 
in g(x) becomes 


r 
( > ano)s 2=e, 0S ee; 
ey te,tes=r e€,!en!e3! r! 
This coefficient of x’/r! is just what we wanted. So the exponential generating 
function for the number of such r-letter words is indeed the expression (2). 
What makes exponential generating functions work is the “‘trick’’ of dividing 
x" by r! and multiplying the coefficient of x’ by r!. 

Before we show how to calculate coefficients of an exponential generating 
function, let us give a few other examples of exponential generating function 
models. 


Example 1 


Find the exponential generating function for a,, the number of r arrangements 
(without repetition) of n objects. 

We know that the answer is P(n, r). Since there is no repetition, the 
exponential generating function is (1 + x)'—our old binomial friend! The 


: : : : Xe 
coefficient of x’ in (1 + x)" is &) but now we want the coefficient of ae 
r! 


n\ , n! : ni x’ 
Mo so 
r (n—r)ir! (n—r)!r! 


Thus a, = n!/(n — r)! = P(n, r), as expected. = 
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Example 2 


Find the exponential generating function for a,, the number of different arrange- 
ments of r objects chosen from four different types of objects with each type 
of object appearing at least two and no more than five times. 

The number of ways to arrange the r objects with e; objects of the ith type 
is r!/e,!e,tes!e,! To sum up all such terms with the given constraints, we want 
the coefficient of x’/r! in (x2/2! + x°/3! + x*/4! + x7/5!)*. 


Example 3 


Find the exponential generating function for the number of ways to place r 
(distinct) people into three different rooms with at least one person in each 
room. Repeat with an even number of people in each room. 

The required exponential generating functions are 


x x3 x4 3 x2 x! 3 
(+2 tete--] and (1+5+54---) . 


There are few identities or expansions to use in conjunction with exponential 
generating functions. As a result, there are only a limited number of exponential 
generating functions whose coefficients can be easily evaluated. The fundamen- 
tal expansion for exponential generating functions is 


fake aes 2 
e Sat a oe (3) 
Replacing x by nx in Eq. (3), we obtain 
Les nx? wx? n'x' 
é Re hag eee (4) 


The power series for e* is the Taylor series for e*, which is derived in all 
calculus texts. This expansion and the companion expansion for e™ are valid / 
for all values of x. There is no way to factor out a common power of x in 
exponential generating functions, since the power of x must be matched with 
r! in the denominator. The best that can be done is to subtract the missing 
(lowest) powers of x from e*. For example, the exponential factor representing 
two or more of a certain type of objects can be written 


Da Sie Gee” 3 


Two useful expansions derived from Eg. (3) are 


1 x? xt x6 
Yyertery=1+=— pie a eee : 
Her teN=lta tat ey ¥ (5) 
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|e ee Ee ae cee a 
xe —e ae ee a aes (6) 


Let us work some examples using Egs. (3) to (6). 


Example 4 


Find the number of different r arrangements of objects chosen from unlimited 
supplies of n types of objects. 

In Chapter 5 we would have solved this problem by arguing that there are 
n choices for the type of object in each of the r positions, for a total of n’ 
different arrangements. Now let us solve this problem with exponential generat- 
ing functions. The exponential generating function for this problem is 


2 3 n 
xe x 
(VereEsEe...) = (e*)" = e™ 


2! 3! 


By Eq. (4), the coefficient of x’/r! in this generating function is n’. = 


Example 5 


Find the number of ways to place 25 people into three rooms with at least one 
person in each room. 
In Example 3, we found the exponential generating function for this problem 


2 3 3 
(r+5424-..) =(e— 1)3 


To find the coefficient of x’/r! in (e* — 1)3, we first must expand this binomial 
expression in e* 
(e* — 1)? = e* — 3e* + 3e"- 1 


From Eq. (4), we get 
e* — 3e* + 3e7- 1 = 39-3 SIVA E-1 
r=0 r! r=0 r} r=0 r! 
So the coefficient of x¥/25! is 3° — (3 X 2%) + 3. = 


Example 6 


Find the number of r-digit quaternary sequences (whose digits are 0, 1, 2, and 
3) with an even number of Os and an odd number of 1s. 


270 Chapter 6 Generating Functions 


The exponential generating function for this problem is 


2 4 6 3 7 2 3 2 
(hehehe Verte Sede reed e-) 


2! 4! #6! 2! 3! 


Using identities (5) and (6), we can write this expression as 


xe" +e) xX xe" -erverer= re &t+&—e*ee 


ee ae ee ye een Oe ee ee eae 
te e*Ye* = Fe 1) ee a 1 


Then for r > 0, the coefficient of x’/r! is 34” = 4’"'. The simple form of this 
answer suggests that there should be some combinatorial argument for obtaining 
this short answer directly. = 


6.4 EXERCISES 


M Summary of Exercises Most of the first 15 exercises are simi- 


lar to the examples in this section. Exercises 21 and 22 are a continuation of 
the probability generating function exercises in Section 6.2. 


1. 


2. 


Find the exponential generating function for the number of arrangements 
of r objects chosen from five different types with at most five of each type. 


Find the exponential generating function for the number of ways to distrib- 
ute r people into six different rooms with between two and four in each 
room. 


. Find an exponential generating function for a,, the number of r-letter words 


with no vowel used more than once (consonants can be repeated). 


. (a) Find the exponential generating function for s,,, the number of ways 


to distribute r distinct objects into n distinct boxes with no empty box. 
Consider n a fixed constant. 


(b) Determine s,,. The number s,,/n! is called a Stirling number of the 
second kind. 


Find the exponential generating function, and identify the appropriate coef- 
ficient, for the number of ways to deal a sequence of 13 cards (from a 
standard 52-card deck) if the suits are ignored and only the values of the 
cards are noted. 


. How many ways are there to distribute eight different toys among four 


children if the first child gets at least two toys? 


7. 


8. 


9 
< 


10. 


11. 


12. 


13. 


14, 


15. 


16. 


17. 
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How many r-digit ternary sequences are there with: 

(a) An even number of 0s? 

(b) An even number of Os and even number of 1s? 

How many ways are there to make an r-atrangement of pennies, nickels, 


dimes, and quarters with at least one penny and an odd number of quarters? 
(Coins of the same denomination are identical.) 


How many 10-letter words are there in which each of the letters e, n, r, 5 
occur: 


(a) At most once? (b) At least once? 


How many r-digit ternary sequences are there in which: 

(a) No digit occurs exactly twice? 

(b) 0 and 1 each appear a positive even number of times? 

How many r-digit quaternary sequences are there in which the total number 
of Os and 1s is even? 


Find the exponential generating function for the number of ways to distrib- 
ute r distinct objects into five different boxes when b, < b, = 4, where 
b,, by are the numbers of objects in boxes 1 and 2, respectively. 


(a) Find the exponential generating functions for D,, the probability that 
the first two boxes each have at least one object when r distinct objects 
are randomly distributed into n distinct boxes. 


(b) Determine p,. 


Find an exponential generating function for the number of distributions of 
r distinct objects into n different boxes with exactly m nonempty boxes. 


Find an exponential generating function with: 
(a) a, = 1f(r + 1) (b) a, =r! 


Show that if g(x) is the exponential generating function for a,, then g(0) 
= a, where g(x) is the kth derivative of g(x). 
If 


— ay 
{a= » a g(x) = > b= 
and 


Oo x’ 
hex) =f@)8@) = Ye 5 
then show that 
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18. Show that e’e? = e**” by formally multiplying the expansions of e* and 
e” together. 


19. Find a combinatorial argument to show why the answer in Example 6 
is 47. 

20. Show that e*/(1 — x)" is the exponential generating function for the number 
of ways to choose some subset (possibly empty) of r distinct objects and 
distribute them into n different boxes with the order in each box counted. 


21. A Poisson random variable X has p, = Prob(x = r) = f e“, Find the 


r! 
probability generating function for X (see Exercise 34 in Section 6.2). 
22. Let P(x) = dt-0 p,x* be the probability generating function for the discrete 
random variable X, that is, p, is the probability that X=k. 
(a) Show that the exponential generating function for m,, the kth moment 
of X, m, = di=o jp, is P(e*). 
(b) The kth factorial moment of X, m#, is defined to be equal to Dirk 
I! 
(j—%! 
P(x + 1). 
(c) If X is the number of heads when n coins are flipped, find m, m2, 
mx. (Hint: Use Exercise 16.) 


(d) If X is Poisson (see Exercise 21), find m, and m*. 


—e"rreor_—f 


Show that the exponential generating function for mk is 


6.5 A SUMMATION METHOD 


In this section we show how to construct an (ordinary) generating function 
h(x) whose coefficient of x’ is some specified function p(r) of r, such as r? or 
C(r, 3). Then we use h(x) to calculate the sums p(0) + p(1) + °° + p(n), 
for each positive n. The following four simple rules for constructing a new 
generating function from an old one will be used repeatedly in this section. 
Assume that A(x) = Sa,X,, B(x) = Lbpxn, and C(x) = DYenXn- 

(i) If b, = da,, then B(x) = dA(x), for any constant d. 

(ii) If cy = ad, + bn, then C(x) = AQ) + B(x). 

(iii) If c, = Xho a:b,-1, then C(x) = A(x)B(x). 

(iv) If b, = nz, except b; = 0 for i < k, then B(x) = x‘A(x). 


ll 


Rule (iii) is simply expression (6) from Section 6.2. The other rules are imme- 
diate. 
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The other basic operation for our coefficient construction is to multiply 
each coefficient a, in a generating function g(x) by r. The new generating 
function g*(x) with a* = ra, is obtained by differentiating g(x) and then 


multiplying by x, that is, g*(x) = (4 #00). If 


8(X) = ay + ax t+ x? + ayxrite++tax't+--- (1) 


then differentiation of g(x) yields 
S ela) =a, + 2ax t+ 3agx?> t+ +rax +--+: (2) 
and now multiplying by x [Rule (iv)] gives 
g*(x) = | 2209] = ax + 2anx? + 3a3x3 + +++ 4+ ra,x' +++ (3) 


Note that the a term in g(x) disappears in Eq. (2) because Oa, = 0. 

Combining this operation with Rules (i) and (ii), we can repeatedly multiply 
the coefficient of x” by r or a constant and can add such coefficients together. 
This permits us to form polynomials in r. 

The natural question now is, To what coefficients do we apply these opera- 
tions? The natural answer is, When in doubt start with the unit coefficients 
a, = 1 of the generating function 

1 


ToS Lt xt xP tae ter tur te. 
1—-x 


Example 1 


Build a generating function h(x) with a, = 2r?. 
Starting with 1/(1 — x), we multiply coefficients by r using the generating 
function operations shown in Eqs. (2) and (3) to obtain 


d | 1 7 
— = —__—_ } = + oe aera Ppeee 
(4; 1) (5 +.) 1x + 2x? + 3x rx 


Now we repeat these operations on x/(1 — x)? to obtain 


d x x(1 + x) 
et ia ees = [244+ 2x24 253 eee Ayre... 
x( <5) Ga x + 2°x 3°x? + + rx 


Finally we multiply by 2 to obtain the required generating function 


OY = 2 x xt (2% BaP + (2K FX 
(1 —~x) 


beset (2X rx tes: 


h(x) = 
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Example 2 
Build a generating function h(x) with a, = (r + Ir(r — 1). 

We could multiply (r + 1)r(r — 1) out getting a, = r3 — r, obtain generating 
functions for r? and ras an Example 1, and then subtract one generating function 
from the other. It is easier, however, to start with 3!(1 — x) *, whose coefficient 
a, equals 


anal (TPA) a CEB EE = 4 Kr + HEED 
r r!3! r! 


Then the power series expansion of 3!(1 — x)“ is 


! 
Gopi BX 2” 1+ (4X3 X 2)x + (5X 4X 3)x? 

tere t(r t+ 3rt2r+ 1x +>: (4) 
Compare Eq. (4) with the desired generating function 


h(x) = (3X 2X Vx? + (4X 3X xi + 5 X4X 3)x4 
tee-t(rt)Ir(r— Ux’ t+-°: 


Clearly A(x) is just the series in Eq. (4) multiplied by x°. So A(x) = 31x71 — 
x) 4. # 


Generalizing the construction in Example 2, we see that (n — 1)!(1 — x) 
has a coefficient 


a,=(n-DICr+tn-1n= (r+ @— Dirt @— De e+ DY 


Any coefficient involving a product of decreasing terms, such as (r + l)r 
(r — 1), can be built from (” — 1)!(1 — x)" as in Example 2, where n — 1 
is the number of terms in the product. 

Thus far the construction of an h(x) with specified coefficients has been 
just an exercise in algebraic manipulations of generating functions. The follow- 
ing easily verified theorem gives some purpose to this exercise. 


Theorem 1 
If h(x) is a generating function with a, the coefficient of x’, then h*(x) = 
h(x)/(1 — x) is a generating function of the sums of the a,s. That is, 


h*(x) = ay + (ao + ai)x + (do + a + a2)x? +0 + ay aie Ane 
This theorem follows from Rule (iii) for the coefficients of the product 


h*(x) = fh), where f(x) = 1/ — x). Now we return to the previous 
examples. 
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Example 1 (continued) 


Evaluate the sum 2 X 17+2* 27+2x 3 4--++ + 2y2 

The generating function h(x) for a, = 2r? was found in Example 1 to be 
2x(1 + x)/(1 — x). Then by Theorem 1, the desired sum QA, t+ apt +354 
a, is the coefficient of x” in 


h*(x) = h(x)/ — x) = 2x(1 + )/C1 — x)4 
= 2x(1' x)" +271 —xy* 
The coefficient of x” in 2x(1 — x)~‘ is the coefficient of x”! in 2(1 — x)4, 


and the coefficient of x" in 2x7(1 — x)~‘ is the coefficient of x"~? in 21 — x)". 
Thus, the given sum equals 


(n—1)+4-1 (n-—2)+4-1 - n+2 n+1 
a (2-1) ) +2 (n-2) )=2{ 3 ) +a 3 : 


Example 2 (continued) 


Evaluate the sum 3 X2X1+4x3x2+ ot (n+ Intn — 1), 

The generating function h(x) for a, = (r + 1)r(r — 1) was found in 
Example 2 to be h(x) = 6x*(1 — x)". By Theorem 1 the desired sum is the 
coefficient of x” in h*(x) = h(x)/(1 — x) = 6x(1 — x)~>. This coefficient is 
the x"? term in 6(1 — x)~>, namely, 6C((n — 2) + 5 — 1,n — 2) = 6C(n + 
2,4). = 


Note that these two summation problems can also be solved with the 
summation method in Section 5.5 based on binomial identities. Both methods 
have their advantages. 


6.5 EXERCISES 
1. Find ordinary generating functions whose coefficient a, equals: 


(a) r (b) 13 (©) 3r? dd) 3r +7) rr — lyr - 2yr — 3). 
2. Evaluate the sums (using generating functions): 

(a)O+1+2+4+---4n 

(b) 13 + 13 4+---4 13 

(c)0+3+124+--+ +37 

(d)7+10+134+---+@Gn+7) 


@)4X3X2xX1+5X4X3X2+ + t+ann—-Da-DaA- 
3), ; 
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3. Finda generating function witha, = r(r + 2) (do not add together generating 
functions for r? and for 27). 


4. (a) Show how 7? and r? can be written as linear combinations of P(7,3), 
P(r,2), and P(r,1). 

(b) Use part (a) to find a generating function for 3r3 — 5° + 4r. 

5. Find a generating function for 
(a) a, =(r- ly (b) a, = 1/r. 

6. If h(x) is the ordinary generating function for a,, what is the coefficient of 
x' in h(x)\(1 — x) (give your answer in terms of the a,s). 

7. Verify Theorem 1 in this section. 


8. If h(x) is the ordinary generating function for a,, find the generating function 
for s, = Sf,+1 a, assuming all s,s are finite and a, ~ Oasr— ©. 


‘}y° 6.6 SUMMARY AND REFERENCES 


Generating functions are at once a simpleminded and a sophisticated mathemati- 
cal model for counting problems; simpleminded because polynomial multiplica- 
tion is a familiar, seemingly well-understood part of high school algebra and 
sophisticated because with standard algebraic manipulations on generating func- 
tions, one can solve complicated counting problems. These algebraic manipula- 
tions automatically perform the correct combinatorial reasoning for us! Note 
that generating functions are an elementary example of the algebraic approach 
that pervades the research frontiers of contemporary mathematics. Letting alge- 
braic expressions, whether they model combinatorial, geometric, or functional 
information, do the work for us is what much of modern mathematics is all 
about. 

In this chapter, generating functions were used to model selection and 
arrangement problems with constrained repetition. Partition problems were also 
modeled (but were not solved). Finally, we showed how to construct generating 
functions whose coefficient for x” was a given function of r and used these 
generating functions to evaluate related sums. In the next three chapters, generat- 
ing functions will be used to model and solve other combinatorial problems. 
Exercise 34 of Section 6.2 introduces one type of generating function used in 
probability theory (for more, see Feller [2]). Laplace and Fourier transforms 
in analysis are also generating functions [the Fourier transform of a function 
f (0) can be viewed as a generating function for the Fourier coefficients of f(A]. 

The first use of combinatorial generating functions was by DeMoivre around 
1720. He used them to derive a formula for Fibonacci numbers (this derivation 
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is given in Section 7.5). In 1748, Euler used generating functions in his work 
on partition problems. The theory of combinatorial generating functions, devel- 
oped in the late eighteenth century, was primarily motivated by parallel work 
on probability generating functions (see Exercises 34 to 38 in Section 6.2 and 
Exercises 21 and 22 in Section 6.4). Laplace made many contributions to both 
theories and presented the first complete treatment of both in his 1812 classic 
Theorie Analytique des Probabilities. 


For a full discussion of the use of generating functions in combinatorial 


mathematics, see MacMahon [3] or Riordan [4]. For a nice presentation of 
partition problems, also see Berge [1]. 


1. 
2. 


C. Berge, Principles of Combinatorics, Academic Press, New York, 1971. 
W. Feller, An Introduction to Probability Theory and Its Applications, vol. 
I, 2nd ed., John Wiley & Sons, New York, 1957. 

P. MacMahon, Combinatory Analysis, vols. I and II (1915), reprinted in 
one volume, Chelsea Publishing, New York, 1960. ‘ 

J. Riordan, An Introduction to Combinatorial Analysis, John Wiley & Sons, 
New York, 1958. 


CHAPTER 7 


RECURRENCE 
RELATIONS 


7.1 RECURRENCE RELATION MODELS 


In this chapter we show how a variety of counting problems can be modeled 
with recurrence relations. We then discuss methods of solving several common 
types of recurrence relations. 

A recurrence relation is a recursive formula that counts the number of 
ways to do a procedure involving n objects in terms of the number of ways to 
do it with fewer objects. That is, if a, is the number of ways to do the procedure 
with k objects, for k = 0, 1, 2,..., then a recurrence relation is an equation 
that expresses a, as some function of preceding a,s, k < n. The simplest 
recurrence relation is an equation such as a, = 2a,-,. The following equations 
display some of the forms of recurrence relations that we will derive to model 
counting problems in the chapter: 


An = CiAn—1 + C2Qn-2 + °° + + CAn-, where ¢; are constants 
Gy = Can) + f(n) where f(7) is some function of n 
An = AyAn-| F AAy-2 + * + + + Ay-1Ay 


Anm = Qn-1,m + Qn-1,m-1 


Just as mathematical induction is a proof technique that verifies a formula 
or assertion by inductively checking its validity for increasing values of n, so 
a recurrence relation is a counting technique that solves an enumeration problem 
by recursively computing the answer for successively larger values of n. 

The observant reader will remember that mathematical induction also 
involves an initial step of verifying the formula or assertion for some starting 
(smallest) value of n. The same is true for a recurrence relation. We cannot 
recursively compute the next a, unless some initial values are given. If the 
right-hand side of a recurrence relation involves the r preceding a,s, then we 
need to be given the first r values, aj, a), .. . , d,-1. For example, in the relation 
Ay = An-| + @,-2, knowing only that a, = 2 is insufficient. If given also that 
a, = 3, then we use the relation to obtain a, = a, + qa =3+2=5;a,= 
a+ a,=5 + 3 = 8; a, = 8 + 5 = 13; and so on. The information about 
starting values needed to compute with a recurrence relation is called the initial 


conditions. 
279 | 


280 Chapter 7 Recurrence Relations 


If we can devise a recurrence relation to model a counting problem we are 
studying and also determine the initial conditions, then it is usually possible 
to solve the problem for moderate sizes of n, such as n = 20, in a few minutes 
by recursively computing successive values of a, up to the desired n. For larger 
values of n, a programmable calculator or computer is needed. 

For many common types of recurrence relations, there are explicit solutions 
(formulas) for a,. Sections 7.3—7.5 discuss some of these solutions. However, 
it is frequently easier to determine recursively the value of a, a,,... up to 
the desired value, say, a). than to compute a, directly from a formula for a,. 


Example 1 


Find a recurrence relation for the number of ways to arrange n distinct objects 
in a row. Find the number of arrangements of eight objects. 

Let a, denote the number of arrangements of n objects. There are n choices 
for the first object in the row. This choice can be followed by any arrangement 
of the remaining n — 1 objects; that is, by the a,_, arrangements of the remaining 
n — 1 objects. Thus a, = na,-,. Substituting recursively in this relation we see 
that 


a, = na,-; = n(n — laa] = ++ = n(n — Vn 20+ kK 2X 1L=a! 


In particular, ag = 8! = 


Of course, we already know that the number of arrangements of n objects 
is n! It is often useful to try out a new technique on an old problem to see 
how it works before using it on new problems. Now for some new problems. 


Example 2 


An elf has a staircase of n stairs to climb. Each step it takes can cover either 
one stair or two stairs. Find a recurrence relation for a,, the number of different 
ways for the elf to ascend the m-stair staircase. 

It is easy to see that a, = 1, a, = 2 (two one-steps or one two-steps) and 
a; = 3. For n = 4, Figure 7.1a depicts one way of climbing the stairs, taking 
successive steps of sizes 1,2, 1. Other possibilities are a two-step either preceded 
or followed by two one-steps, or two two-steps, or four one-steps. In all, we 
count five ways to climb the four stairs; so a, = 5. 

Is there some systematic way to enumerate the ways to climb four stairs 
that breaks the problem into parts involving the ways to climb three or fewer 
stairs? Clearly, once the first step is taken there are three or fewer stairs 
remaining to climb. Thus we see that after a first step of one stair, there are 
a, ways to continue the climb up the remaining three stairs. If the first step 
covers two stairs, then there are a, ways to continue up the remaining two 
stairs. So a, = a; + a. We confirm that the values for a,, a3, a, satisfy this 
relation: 5 = 3 + 2. This argument applies to the first step when climbing any 
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ay. 2 Ways 


Figure 7.1 (a) (b) (c) 


number of stairs, as is shown in Figures 7.1b and 7.1c. Thus a, = a,-,; + 
a,-2- © 


In Section 7.3 we obtain an explicit solution to this recurrence relation. 
The relation a, = a,-, + a,-2 is called the Fibonacci relation. The numbers 
a, generated by the Fibonacci relation with the initial conditions a) = a; = 1 
are called the Fibonacci numbers. They begin 1, 1, 2, 3, 5, 8, 13, 21, 34, 
55, 89. Fibonacci numbers arise naturally in many areas of combinatorial 
mathematics. There is even a journal, Fibonacci Quarterly, devoted solely to 
research involving the Fibonacci relation and Fibonacci numbers. Fibonacci 
numbers have been applied to other fields of mathematics, such as numerical 
analysis. They occur in the natural world—for example, the arrangements of 
petals in some flowers. For more information about the occurrences of Fibonacci 
numbers in nature see [1]. 


Example 3 


Suppose we draw n straight lines on a piece of paper so that every pair of lines 
intersect (but no three lines intersect at a common point). Into how many 
regions do these n lines divide the plane? 

Again we approach the problem by examining the situation for small values 
of n. With one line, the paper is divided into two regions. With two lines, we 
get four regions, that is, a, = 4. See Figure 7.2a. From Figure 7.2b, we see 
that a; = 7. The skeptical reader may ask, how do we know that three intersecting 
lines will always create seven regions? Let us go back one step then. 

Clearly two intersecting lines will always yield four regions, as shown in 
Figure 7.2a. Now let us examine the effect of drawing the third line (labeled 
“‘3”’ in Figure 7.2b). It must cross each of the other two lines (at different 
points). Before, between, and after these two intersection points, the third line 


ied 


Figure 7.2 (a) 
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cuts through three of the regions formed by the first two lines (this action of 
the third line does not depend on how it is drawn, just that it intersects the 
other two lines). So in severing three regions, the third line must form three 
new regions, actually creating six new regions out of three old regions. Thus 
a; = a, + 3 = 4 + 3 = 7, independently of how the third line is drawn. 

Similarly, the fourth line severs four regions before, between, and after its 
three intersection points with the first 3 lines (see Figure 7.2c), so that a, = 
a, + 4= 7+ 4 = 11. In general, the nth line must sever n regions before, 
between, and after its n — 1 intersection points with the first n — 1 lines. So 
a, = @,-; +n. = 


Example 4 


The Tower of Hanoi is a game consisting of n circular rings of varying size 
and three pegs on which the rings fit. Initially the rings are placed on the first 
(leftmost) peg with the largest ring at the bottom covered by successively 
smaller rings. See Figure 7.3a. By transferring the rings among the pegs, one 
seeks to achieve a similarly tapered pile on the third (rightmost) peg. The 
complication is that each time a ring is transferred to a new peg, the transferred 
ring must be smaller than any of the rings already piled on this new peg; 
equivalently, at every stage in the game there must be a tapered pile (or no 
pile) on each peg. 

Find a recurrence relation for the minimum number of moves required to 
play the Tower of Hanoi. How many moves are needed to play the six-ring 
game? Try playing this game with a dime, penny, nickel, and quarter (four 
rings) before reading our solution. 

The key observation is that if, say, the six smallest rings are on peg A and 
we want to move them to peg C, we must first ‘‘play the five-ring Tower-of- 
Hanoi game’’ to get the five smallest rings from peg A to peg B, then move 
the sixth smallest ring from peg A to peg C, and then again ‘“‘play the five- 
ring game’’ from peg B to peg C. See Figure 7.3 (Of course, to move the five 
smallest rings we must move the fifth smallest from A to B, which means 
playing four-ring games from A to C and from C to B, etc.) Thus to move the 
n rings from A to C, the n — 1 smallest rings must first be moved from A to 


A A 
B 


(a) (5) 
Figure 7.3 Tower of Hanoi 
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B, then the largest (nth) ring moved from A to C and then the n — 1 smallest 
rings moved from B to C. 

If a, is the number of moves needed to transfer a tapered pile of n rings 
from one peg to another peg, then a, = a,-; + 1 + a,-; = 2a,-; + 1. The 
initial condition is a, = 1, and so a, = 2a, + 1 = 3; a; = 2a, + 1 = 7; 
a, = 2a; + 1 = 15; as = 2a, + 1 = 31; and a; = 2a; + 1 = 63. So the 
six-ring game requires 63 moves. Note that the a,’s thus far fit the formula 
a,= 2"-— 1. = 


Example 5 


A bank pays 8 percent interest each year on money in savings accounts. Find 
recurrence relations for the amounts of money a gnome would have after n 
years if it follows the investment strategy of: 


(a) Investing $1000 and leaving it in the bank for n years. 
(b) Investing $100 at the end of each year. 


If an account has x dollars at the start of a year, then at the end of the year 
(i.e., start of the next year) it will have x dollars plus the interest on the x 
dollars, provided no money was added or removed during the year. Then for 
part (a), the recurrence relation is a, = a,-; + .08a,-; = 1.08a,_,. The initial 
condition is @) = 1000. For part (b), the relation must reflect the $100 added 
(which earns no interest since it comes at the end of the year). So a,= 1.08a,_; 
+ 100, with a) = 0. = 


Example 6 


Find a recurrence relation for the number of different ways to hand out a piece 
of chewing gum (worth 1¢) or a candy bar (worth 10¢) or a donut (worth 20¢) 
on successive days until n¢ worth of food has been given away. 

This problem can be treated similarly to the stair-climbing problem in 
Example 2. That is, if on the first day we hand out 1¢ worth of chewing gum, 
we are left with (n — 1)¢ worth of food to give away on following days; if 
the first day we hand out 10¢ worth of candy, we have (n — 10)¢ worth to~ 
dispense the next days; and if 20¢ then (n — 20)¢ the next days. So a, = a,-; 
+ G,-19 + Gy-29 With a) = 1 (one way to give nothing—by giving 0 pieces of 
each item), and implicitly, a, = 0 fork <0. m= 


Example 7 


Find a recurrence relation for a,, the number of n-digit ternary sequences 
without the subsequence ‘‘012.”’ 

Recall that a ternary sequence is a sequence composed of Os, Is and 2s. 
We start with the sort of analysis used in the elf stair-climbing problem. If the 
first digit in an n-digit ternary sequence is 1, then there are a,_, (n — 1)-digit 
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ternary sequences without the pattern 012 that can follow that initial 1. Similarly 
if the first digit is 2. 

However, there is a problem if the first digit is a 0. Among the (n — 1)- 
digit ternary sequences without the pattern 012 that might follow the initial 0 
are sequences that start with 12. While such (nm — 1)-digit sequences do not 
contain the pattern 012, the n-digit sequences of 0 followed by such (n — 1)- 
digit sequences do start with the pattern 012. We correct this mistake by 
subtracting off all n-digit sequences starting with 012 (but with 012 not 
appearing later in the sequence). Such sequences are formed by 012 followed 
by any (n — 3)-digit ternary sequence with no 012 pattern—there are a,-; such 
sequences. Thus the desired recurrence relation is: a, = @,-; + Qa—1 + (@i-1 — 
Qy-3) = 3@y-1 — An-3. 


The preceding examples developed recurrence relation models either by 
breaking a problem into a first step followed by the same problem for a smaller 
set (Examples 1, 2, 6 and 7) or by observing the change in going from the 
case of n — | to the case of n (Examples 3 and 5). Example 4 was a modified 
form of the former procedure in which the move of the nth ring (largest ring) 
was seen to be preceded and followed by an (n — 1)-ring game. 

There are two simple methods for solving some of the relations seen thus 
far. The first is recursive backward substitution: wherever a,-, occurs in the 
relation for a,, we replace a,_, by the relation’s formula for a,_, (involving 
a,-2) and then replace a,-,, and so on. In Example 3, the relation a, = a,-) + 
n becomes a, = (4-2 tn-—1A)tn=ss+ = 1L+24+34+-+-4+n-1 t+ 
n, We used this method in Example 1 to obtain a, = n(n — 1)(n — 2) X +--+ 
xX2xX1. 

The second method is to guess the solution to the relation and then to 
verify it by mathematical induction. In Example 4, we noted that a, = 2” — 1 
for the first six values of the recurrence relation a, = 2a,-, + 1, a; = 1. We 
prove a, = 2" — 1 by induction as follows. It was true for n = 1. Assuming 
this solution is true for a,_,, we then have a, = 2a,-, + 1 = 2(27'-— 1)+ 1 
= 2h 2a = 2 od, 


Example 8 


Use the following recurrence relation to determine the chromatic polynomial 
P.(G) of the graph G in Figure 7.4a. Recall that P,(G) gives the number of 
different k-colorings of G (see Example 7 in Section 5.2). For any two nonadja- 
cent vertices x, y in G, Gj is obtained from G by adding the edge (x, y), and 
G‘, is obtained from G by coalescing vertices x and y into a single vertex that 
is adjacent to all vertices formerly adjacent to x or y. Gj, corresponds to colorings 
of G in which x and y receive different colors, and G‘, corresponds to colorings 
of G in which x and y have the same color. Then 


P(G) = P,(Gy) + Pi(Gy) 


7.1 Recurrence Relation Models 285 


Figure 7.4 (a) (b) (c) 


To apply this recurrence relation to the graph in Figure 7.4a, we need some 
‘‘initial condition’’ graphs whose chromatic polynomial is known. The required 
building blocks are the collection of complete graphs on c vertices K, (all 
vertices mutually adjacent). In Example 7 of Section 5.2 we showed that P;(K.) 
= P(k,c) = kt/(k — c)! 

Consider the nonadjacent pair b, d in G. Then P,(G) = P,(Gju) + 
P,(G‘,). Figure 7.4b shows Gj, and Figure 7.4c shows Gj,. To determine 
P,(Gjz) we use the nonadjacent pair c, e: Px(Gju) = Pi((Gia)ce) + Pi(Gra)te) = 
P(Ks) + PA(K,) = kt/(k — 5)! + kA — 4)! Similarly, P.(Gi.) = 
P.((Ghace) + Pi(Gha) = Pi((Gia)ie) = Pi(Ka) + Pr(K3) = kY/(k — 4)t + ktY/ 
(k — 3)! In total, P(G) = k1/(k — 5)! + 2[kKI/(k — 41] + k/(k — 3)! 


We now consider some more complex recurrence relations involving two 
variable relations and simultaneous relations. 


Example 9 


Let a,,, denote the number of ways to select a subset of k objects from a set 
of n distinct objects. Find a recurrence relation for a,,,. 


Observe that a,, is simply (") We break the problem into two subcases 


based on whether or not the first object is used. There are a,-;, k-subsets that 
do not use the first object, and there are a,-;,- k-subsets that do use the first 
object. So we have a4 = Qn-1,¢ + An-i4-1- This is the Pascal’s Triangle Identity 
[identity (3) from Section 5.5]. The initial conditions are @,9 = @,,, = 1 for all 
n=O(anda,,=0,k >n). = 


Example 10 


Find a recurrence relation for the ways to distribute n identical balls into k 
distinct boxes with between two and four balls in each box. Repeat the problem 
with balls of three colors. 

In the spirit of the previous example, we consider how many balls go into 
the first box. If there are two in the first box, then there are a,_.,-; ways to 
put the remaining n — 2 identical balls in the remaining & — 1 boxes. Continuing 
this line of reasoning, we see that a,, = Gp-24-1 + Gp—3e-1 + Gn—ayz-1. The initial 
conditions are a); = a3; = a4, = 1 and a, = 0 otherwise. 
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If now three colors are allowed, there are C(2 + 3 — 1, 2) = 6 ways to 
pick a subset of two balls for the first box from three types of colors with 
repetition. (Review the start of Section 5.3 for a discussion of selection with 
repetition.) Similarly, there are C(3 + 3 — 1,3) = 10 ways to pick three balls 
from 3 types and C(4 + 3 — 1, 4) = 15 ways to pick four balls from 3 types. 
Then @,4 = 64,-24-1 + 10d,-34-1 + 15a,-44-1 with initial conditions a,, = 6, 
a,, = 10, ay; = 15, and a,; = 0 otherwise. 

The problem with all balls identical can be solved by generating functions 
(see Example 3 of Section 6.1), but recurrence relations are the only practical 
approach with the extra constraint of different types of balls. = 


Example 11 
Find a system of recurrence relations for: 


(a) The number of n-digit binary sequences with an even number of Os, and 


(b) The number of n-digit ternary sequences with an even number of Os and 
an even number of Is. 


(a) We use the first-step analysis of the stair-climbing model to find a 
recurrence relation for a,, the number of n-digit binary sequences with an even 
number of Os. If an n-digit binary sequence starts with a 1, then we require an 
even number of Os in the remaining (n — 1)-digit sequence—a,_, such 
sequences. If the n-digit sequence starts with a 0, we require an odd number 
of Os in the remaining n — 1 digits—2""' — a,_, such sequences, since all 2"! 
(n — 1)-digit sequences minus the even-Os (n — 1)-digit sequences yields the 
odd-0s (n — 1)-digit sequences. In sum, a, = a,-) + (2! — a,-1) = 2" '. Our 
recurrence relation reduces into a formula for a,. In words, every (n — 1)-digit 
sequence yields an even-Os n-digit sequence by adding a 1 in front if the (n - 
1)-digit sequence has an even number of Os and adding a 0 in front if the 
sequence has an odd number of Os. 

(b) We will need simultaneous recurrence relations for a,, the number of 
n-digit ternary sequences with even Os and even 1s; b,, the number of n-digit 
ternary sequences with even Os and odd 1s; and c,, the number of n-digit 
sequences with odd Os and even 1s. Observe that 3" — a, — b, — c, is the 
number of n-digit ternary sequences with odd Os and odd 1s. An n-digit ternary 
sequence with even Os and even Is is obtained either by having a 1 for the 
first digit followed by an (nm — 1)-digit sequence with even Os and odd 1s, or 
a 0 followed by an (n — 1)-digit sequence with odd Os and even Is, or a 2 
followed by an (nm — 1)-digit sequence with even Os and even 1s. Thus a, = 
by-1 + Cp + @,-1. Similar analyses yield b, = a,-; + (37! — ay-1 — ba-1 — 
Cn-1) + Dy ac 3" — Cn-1 and C, = Qn-1 a GB"! — Ay-) ~ Da-1 TS Cn~1) + Cn-1 
= 3"! — b,_,. The initial conditions are a, = b, = c, = 1. To recursively 
compute values for a,, we must simultaneously compute b, and c,. = 
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We close this section with a few words about difference equations. The 
first (backward) difference Aa, of the sequence (ap, a,, a), .. .) is defined to 
be Aa, = a, — a@,-;. The second difference is A’a, = Aa, — Aa,-; = a, — 
2an-1 + Gyr, and so on. A difference equation is an equation involving a, and 
its differences, such as 2A’a, — 3Aa, + a, = 0. Observe that 


Qn-| = An — (a, = Qn-\) = a, — Aa, (*) 
Qn-2 = Gn-, ~ AQ,-| = (a, — Aa,) — A(a, ~ Aa,) = a, — 2Aa, + A’a, 
Similar equations can express a,-, in terms of a,, Aa,,..., A‘a,. Thus any 


recurrence relation can be rewritten as a difference equation, by expressing the 
a,-,S on the right-hand side of (*) in terms of a, and its differences. Conversely, 
by writing Aa, as a, — a,-;, and so on, any difference equation can be written 
as a recurrence relation. 

Difference equations are commonly used to approximate differential equa- 
tions when solving differential equations on a computer. Difference equations 
have wide use in their own right as models for dynamical systems for which 
differential equations (which require continuous functions) are inappropriate. 
They are used in economics in models for predicting the gross national product 
in successive years. They are used in ecology to model the numbers of various 
species in successive years. As noted above, any difference equation model 
can also be formulated as a recurrence relation; however the behavior of a 
dynamical system is easier to analyze and explain in terms of differences. Refer 
to Sandefur [4] for further information about difference equations and their 
applications. 


Example 12 


Assume that if undisturbed by foxes, the number of rabbits increases each year 
by an amount a,,, where r, is the number of rabbits, but when foxes are present, 
each rabbit has probability Bf, of being eaten by a fox (f, is the number of 
foxes). Foxes alone decrease by an amount yf, each year, but when rabbits are 
present, each fox has probability 57,, of feeding and raising up a new young 
fox (death of foxes is included in the yfn term). Give a pair of simultaneous 
difference equations describing the number of rabbits and foxes in successive 
years. 

The information given about yearly changes in the two populations yields 
the difference equations: 

Ar, = ar, — Brifn 


Afh=Y¥hit orf, m= 


7.1 EXERCISES 


Md Summary of Exercises The first 35 exercises call for recur- 


rence relation modeling similar to that in the examples, with multiple indices 
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and equations required in Exercises 26—35 and difference equations in Exercises 


36- 


1. 


10. 


38. The remaining exercises are more advanced problems. 


Find a recurrence relation for the number of ways to distribute n distinct 
objects into five boxes. What is the initial condition? 


. (a) Find a recurrence relation for the number of ways the elf in Example 


2 can climb a stairs if each step covers either one or two or three 
stairs? 


(b) How many ways are there for the elf to climb five stairs? 


. Find a recurrence relation for the number of ways to arrange cars in a row 


with n spaces if we can use Cadillacs or Continentals or Fords. A Cadillac 
or Continental requires two spaces, whereas a Ford requires just one space. 


. (a) Find a recurrence relation for the number of ways to go n miles by 


foot walking at 3 miles per hour or jogging at 6 miles per hour or 
running at 10 miles per hour; at the end of each hour a choice is made 
of how to go the next hour. 


(b) How many ways are there to go 14 miles? 
Find a recurrence relation for the number of ways to distribute a total of 


n cents on successive days using 1947 pennies, 1958 pennies, 1971 pennies, 
1951 nickels, 1967 nickels, 1959 dimes, and 1975 quarters. 


. (a) Find a recurrence relation for the number of n-digit binary sequences 


with no pair of consecutive 1s. 
(b) Repeat for n-digit ternary sequences. 


(c) Repeat for n-digit ternary sequences with no consecutive 1s or consecu- 
tive 2s. 


. Find a recurrence relation for the number of pairs of rabbits after n months 


if: (1) initially there is one pair of rabbits who were just born; and (2) 
every month each pair of rabbits that are over one month old have a pair 
of offspring (a male and a female). 


. Show that the binomial sum 


C3) 6) Ca 


satisfies the Fibonacci relation. 


. Find a recurrence relation for the number of ways to arrange n dominoes 


to fill a 2-by-n checkerboard. 


Find a recurrence relation for a, for the number of bees in the nth previous 
generation of a male bee, if a male bee is born asexually from a single 
female and a female bee has the normal male and female parents. The 
ancestral chart at the right shows that a, = 1, a, = 2, a; = 3. 


15. 


16. 
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. Find a recurrence relation for a, for the number of ways for an image to 


be reflected n times by internal faces of two adjacent panes of glass. The 
diagram below shows that a = 1, a, = 2, and a, = 3. 


0 reflections 1 reflection 2 reflections 


= Me a 


. Find a recurrence relation for the number of regions created by n mutually 


overlapping circles on a piece of paper (no three circles have a common 
intersection point). 


. (a) Find a recurrence relation for the number of regions created by n lines 


on a piece of paper if k of the lines are parallel and the other n — k 
lines intersect all other lines (no three lines intersect at one point). 


(b) If n = 9 and k = 3, find the number of regions. 


. Show that each of the following rules for playing the Tower of Hanoi 


works. 


(a) On odd-numbered moves, move the smallest ring clockwise one peg 
(think of the pegs being at the corners of a triangle); and on even- 
numbered moves, make the only legal move not using the smallest 
ring. 

(b) Number the rings from 1 to n in order of increasing size. Never move 
the same ring twice in a row. Always put even-numbered rings on top 
of odd-numbered rings (or on an empty peg) and put odd-numbered 
rings on top of even-numbered rings (or on an empty peg). 


Find a recurrence relation for the amount of money in a savings account 
after n years if the interest rate is 6 percent and $50 is added at the start 
of each year. 


(a) Find a recurrence relation for the amount of money outstanding on a 
$30,000 mortgage after n years if the interest rate is 8 percent and the 


yearly payment (paid at the end of each year after interest is computed) 
is $3000. 
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17. 


18. 


19. 


20. 


21. 


22. 


23 


24. 


25. 


26. 


27. 


28. 


29. 


(b) Use a calculator or computer to determine how many years it will take 
to pay off the mortgage. 


Find a recurrence relation to count the number of n-digit binary sequences 
with at least one instance of consecutive Os. 


Suppose a coin is flipped until two heads appear (the two heads need not 
be consecutive), and then the experiment stops. Find a recurrence relation 
for the number of experiments that end on the nth flip or sooner. 


Find a recurrence relation for the number of n-digit quaternary (0, 1, 2, 3) 

sequences with at least one 1 and the first 1 occurring before the first 0 

(possibly no Os). 

Find a recurrence relation for the number of n-digit ternary (0, 1, 2) 

sequences in which no 1 appears anywhere to the right of any 2. 

Find a recurrence relation for the number of n-digit ternary sequences that 

have the pattern ‘‘012’’ occurring for the first time at the end of the 

sequence. 

A switching game has n switches, all initially in the OFF position. In order 

to be able to flip the ith switch, the (i — 1)st switch must be ON and all 

earlier switches OFF. The first switch can always be flipped. Find a recur- 

rence relation for the total number of times the n switches must be flipped 

to get the nth switch ON and all others OFF. 

(a) Let T be a binary tree with all leaves at level n. Find a recurrence 
relation for the number of leaves in T. Repeat for T an m-ary tree. 

(b) Let T be a binary tree with all leaves at level n. Find a recurrence 
relation for the total number of vertices in T. Repeat for T an m-ary 
tree. 


Find a recurrence relation for the number of ways to pair off 2n people 
for tennis matches. 
Determine the chromatic polynomial for the following graphs: 


(a) (b) (c) 


Find a recurrence relation for the number of ways to pick k objects with 
repetition from n types. 

Find a recurrence relation for the number of ways to select n objects from 
k types with at most three of any one type. 

Find a recurrence relation for the number of ways to distribute n distinct 
objects into k indistinguishable boxes with no box empty. 


Find a recurrence relation for the number of partitions of the integer n into 
k parts. 
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. Find a recurrence relation for the number a,,,,, of distributions of n identical 


objects into k distinct boxes with at most four objects in a box and with 
exactly m boxes having four objects. 


. Find a system of recurrence relations for computing the number of n-digit 


binary sequences with an even number of Os and an even number of Is. 


. Find a system of recurrence relations for computing the number of n-digit 


quaternary sequences with: 

(a) An even number of 0s. 

(b) An even total number of Os and 1s. 

(c) An even number of Os and an even number of Is. 


. Find a system of recurrence relations for computing the number of n-digit 


binary sequences with exactly one pair of consecutive Os. 


. Find a system of recurrence relations for the number of n-digit binary 


sequences with k adjacent pairs of 1s and no adjacent pairs of Os. 


. Find a system of recurrence relations for computing the number of ways 


to hand out a penny or a nickel or a dime on successive days until n cents 
are given such that the same amount of money is not handed out on two 
consecutive days. 

Find Aa, and Aa, if a, equals: 

(a) 3n + 2 (b) r° (ce) 


. If the half-life of the element combinatorium is one year (it gives off half 


as much radiation each successive year), give a difference equation for c,,, 
the radiation in the nth year. 


. Let f, be the amount of food that can be bought with n dollars. Let p, be 


the ‘‘perceived’’ value of the $n of food. Suppose the increase in perceived 


- value with one dollar more of food equals the relative, or percentage, 


increase in the actual amount of food. Find a difference equation relating 
Ap, and Af,. 


. (a) Find a recurrence relation for the number of permutations of the first 


n integers such that each integer differs by one (except for the first 
integer) from some integer to the left of it in the permutation. What 
is the initial condition? 

(b) Solve the relation in part (a) by guessing and verifying the guess by 
induction. 


(c) Give a direct combinatorial answer to this problem. 


. Find a recurrence relation for the number of ways to pair off 2n points on 


a circle with nonintersecting chords. (Hint: The recurrence involves prod- 
ucts of a8, ie., + + aa,-4 + + °°). 


. (a) Find a recurrence relation for f(n, k), the number of k subsets of the 


integers 1 through n with no pair of consecutive integers. 
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42. 


43. 


44. 


45. 


46 


47. 


49, 


(b) Show that 22% f(n, k) = Fis, the (n + 1)st Fibonacci number (n 
even), where Fy) = F, = 1. 


Find a system of recurrence relations for computing the number of ways 

to seat n couples around an (unoriented) round table with no pair of spouses 

adjacent. 

Find a system of recurrence relations for computing a,, the number of 

(unordered) collections of (identical) pennies, (identical) nickels, (identical) 

dimes, and (identical) quarters whose value is n cents. 

Find a recurrence relation for the number of ways a coin can be flipped 

2n times with n heads and n tails and: 

(a) The number of heads at any time never being less than the number of 
tails. 

(b) The number of heads equaling the number of tails only after all 2n 
flips. 

Find a recurrence relation for the number of incongruent integral-sided 

triangles whose perimeter is 7 (the relation is different for n odd and n 

even). 

Find a system of recurrence relations for computing the number of spanning 

trees in the ‘‘ladder’’ graph with 2n vertices. 


Verify the following identities for Fibonacci numbers (F; is the ith Fibonacci 
number) by induction or combinatorial argument. Here Fy = F, = 1. 
(a) DF: = Fra 1 (b) > Fi = FeF on 
(©) Dd Fa = Foun (d) F,Fni2 = Fass + (-1" 
(6) fy at B= ey PS SP 
. (a) Show that Frin = FoF, + F-iFn-1- 


(b) From part (a) conclude that F,,.,; divides Fyn-1. 


A set of cards is numbered from 1 to n. If the top card is k, we invert the 
order of the top & cards in the pile and again look at the new top card and 
invert again, and so on. Show that the maximum number of rounds until 
1 is at the top is F,,,. (Hint: Induct on the number & of the top card.) 
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7.2 DIVIDE-AND-CONQUER RELATIONS 


In this section we present a special class of recurrence relations that arise 
frequently in the analysis of recursive computer algorithms. These are algo- 
rithms that use a ‘‘divide-and-conquer’’ approach to recursively split a problem 
into two subproblems of half the size. The dictionary search in Example 3 of 
Section 3.1 is such an algorithm. A binary tree is explicitly or implicitly 
associated with most ‘‘divide-and-conquer’’ algorithms. 

The total number of steps a, required by a divide-and-conquer algorithm 
to process an n-element problem frequently satisfies a recurrence relation of 
the form 


An = CAnn + f(n) (1) 


The following table indicates the form of the solution of Eq. (1) for some 
common values of c, d, and fin r] denots the smallest integer m with m = 


r): 


c f(n) a, 
c=1 d_— dllognl+A 


c=2 4d An—d 


c>2 dn Arie +-( 2d )p 
26 


c=2 adn dn(log,n + A) 


The constant A is to be chosen to fit the initial condition. 

If a problem is recursively split into k parts instead of two parts, then one 
should replace 2 by k everywhere in the foregoing table, except the solution 
for c = k and f(n) = d becomes An — d/(k — 1). For example, the recurrence 
relation 


a, = Cay, + dn c#R 
has the solution 


a, = Anlst + i )p (2) 


c 


The solutions for a, given in the foregoing table are easily verified by 
substitution. Consider the case: a, = Cay. + dn, c # 2. Substituting the table’s 
solution of a, = An'®° + [2d/(2 — c)]n into ca,p + dn, we have 


n \ #2 2d \n 
Cann + dn = c{a() + 7c)? + dn 


_ cAni% | cdn a: (2 —)dn 
Dee 2 2-e¢ 
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logyc _ 
= chris edn += Olt ayine 4 (24 = a, 
c 2:6 2-c¢ 


The following examples illustrate such ‘‘divide-and-conquer’’ recurrences 
and their solution. 


Example 1 


In a tennis tournament, each entrant plays a match in the first round. Next, all 
winners from the first round play a second-round match. Winners continue to 
move on to the next round, until finally only one player is left—the tournament 
winner. Assuming that tournaments always involve n = 2* players, for some 
k, find and solve a recurrence relation for the number of rounds in a tournament 
of n players. 

In terms of binary trees, a, is the height of a balanced binary tree with 
n = 2* leaves. Since half the players are eliminated in each round, the number 
of rounds increases by 1 when the number of players doubles. The recurrence 
relation for a,, the number of rounds, is thus 


a, = Ayn + 1 


From the foregoing table, the solution of this recurrence relation is a, = 
log,n + A. To determine A, we observe that 0 = a; = lol +A =O+A, 
and so A = 0. m= 


Example 2 


Build a recurrence relation model to count the number of comparisons that 
must be made in the following algorithm for finding the largest number / and 
the smallest number s ina set S of n distinct integers. Then solve the recurrence 
relation obtained. 

Initially suppose that n is an even number. Assume that we have already 
found /, and s,, the largest and smallest numbers, respectively, in the first half 
of S (the first n/2 numbers) and have found /, and s, the largest and smallest 
numbers in the second half of S. Then make two comparisons, one between /, 
and /, and the other between s, and s, to find the largest number / and smallest 
number s in S. 

The associated recurrence relation for the number of comparisons in this 
procedure is a, = 2a,. + 2, for n = 4 and even. If n is odd and we split S 
almost equally, the relation would be a, = da+ye + @-ye + 2. Observe that 
a, = 0, since the one number is both largest and smallest. And a, = 1, since 
we can determine the larger and smaller number in a two-element set with one 
comparison. With these two relations along with a, and a2, we can recursively 
determine the number of comparisons needed for any n. 

Next we solve the recurrence relation a, = 2a,. + 2,n 2 4, with a, = 
1. The foregoing table tells us that the solution will be of the form a, = 
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An — 2. We confirm this by substituting a, = An — 2 into both sides of the 
relation a, = 2a,,. + 2: 


An ~ 2 = a, = ayn +2 = YAS — 2) +2 


=An—-4+2=An-2 
We can use the initial condition a, = 1 to determine A: 


1=a,=A,(2)-2 or A= 
So a, = 3n — 2 is the number of comparisons needed to find the largest and 
smallest number according to the procedure given previously. (It is possible to 
prove that one cannot do better than 3n — 2 comparisons.) m= 


Example 3 


Normally one must do n? digit-times-digit multiplications to multiply two n- 
digit numbers. Use a divide-and-conquer approach to develop a faster algorithm. 

Let us initially assume that n is a power of 2. Let the two n-digit numbers 
be g and h. We split each of these numbers into two n/2-digit parts: 


g = 2,10" + g, h = h10"? + hy 
Then 
g X h = (g; X A,)10" + (g) X Ay + Bo X WyY)10"" +g. X hy ~~ (2) 
Observe that 
BX hy + BX hy = (81 + 82) X (hy, + An) — 2) X hy — 22 X Ig 


and so we need to make only three n/2-digit multiplications, g; X h;, 22 X ho, 
and (g, + g2) X (hf, + h)) to determine g X h in Eq. (2) (actually (g; + g2) 
or (h, + h2) may be (n/2 + 1)-digit numbers, but this slight variation does not 
affect the general magnitude of our solution). If a, represents the number of 
digit-times-digit multiplications needed to multiply two n-digit numbers by the 
foregoing procedure, then the procedure yields the recurrence a, = 3a,/. 

By the third line of the table at the start of this section (where d = 0), a, 
is proportional to n'°%? = n'°, a substantial improvement over n7. = 


In some settings, we are not so interested in an exact formula for a, as we 
are in the general rate of growth for a,. The following theorem from Cormen, 
Leiserson, and Rivest [2] gives bounds on such growth: 


Theorem 


Let a, = ca,, + f(n) be a recurrence relation with positive constant c and the 
positive function f(n). 
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(a) If for large n, f(n) grows proportional to n'° [that is, there are positive 
constants p and p’ such that pn < f(n) = p'n"*], then a, grows proportional 
to n%log,c. 

(b) If for large n, f(7) = pn’, where p is a positive constant and q < log,c, 
then a,, grows at most at a rate proportional to n°, 


7.2 EXERCISES 


Mb 1. Solve the following recurrence relations (leaving a constant A to be deter- 


mined): 
(a) a, = 2a,p + 5 (b) a, = 2anj4 +n 
(C) Gy = Any + 2n — 1 (d) a, = 3a,3 + 4 


2. Find and solve a recurrence relation for the number of matches played in 
a tournament with n players, where n is a power of 2. 


3. Ina large corporation with n salespeople, every 10 salespeople report to 
a local manager, every 10 local managers report to a district manager, and 
so forth until finally 10 vice-presidents report to the firm’s president. If 
the firm has n salespeople, where n is a power of 10, find and solve 
recurrence relations for: 


(a) The number of different managerial levels in the firm. 
(b) The number of managers (up through president) in the firm. 


4. Ina tennis tournament, each player wins k hundreds of dollars, where k is 
the number of people in the subtournament won by the player (the subsection 
of the tournament including the player, the player’s victims, and their 
victims, and so forth; a player who loses in the first round gets $100). If 
the tournament has 7 contestants, where n is a power of 2, find and solve 
a recurrence relation for the total prize money in the tournament. 


5. Consider the following method for rearranging the n distinct numbers x,, 
X2,...,X, in order of increasing size (n is a power of 2). Pair the integers 
off {x1, x2}, {x3, x4}, and so forth. Compare each pair and put the smaller 
number first. Next pair off the pairs into sets of four numbers and merge 
the ordered pairs to get ordered four-tuples. Continue this process until the 
whole set is ordered. Find and solve a recurrence relation for the total 
number of comparisons required to rearrange n distinct numbers. (Hint: 
First find the number of comparisons needed to merge two ordered k-tuples 
into an ordered 2k-tuple.) 


6. Verify by substitution the form of solution given in the text to the recurrence 
relations 
(a) an = Any +d (b) a, = 2A njr +d (c) a, = 2A nn + dn 
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7. (a) Use a divide-and-conquer approach to devise a procedure to find the 
largest number in a set of n distinct integers. 


(b) Give a recurrence relation for the number of comparisons performed 
by your procedure. 
(c) Solve the recurrence relation obtained in part (b). 


8. (a) Use a divide-and-conquer approach to devise a procedure to find the 
largest and next-to-largest numbers in a set of n distinct integers. 


(b) Give a recurrence relation for the number of comparisons performed. 
by your procedure. 


(c) Solve the recurrence relation obtained in part (b). 


7.3 SOLUTION OF LINEAR 
RECURRENCE RELATIONS 


In this section we show how to solve recurrence relations of the form 
An = C\An-| + C2An-p tt HF C,An-y (1) 


where the c;s are given constants. There is a simple technique for solving such 
relations. Readers who have studied linear differential equations with constant 
coefficients will see a great similarity between their solution and the form of 
solutions we discuss here. The general solution to Eq. (1) will involve a sum 
of individual solutions of the form a, = a”. To determine what a is, we simply 
substitute a* for a, in Eq. (1), yielding 


a" = ca"! + ca"-24+-+:+¢a"" (2) 


We can reduce the power of @ in all terms in Eq. (2) by dividing both sides 
by a’: 


a’ = car! + ca"? +ee- +e, (3) 


or equivalently 


r-1 


a’ — ca™!— coa’?—++--c,=0 (4) 


Equation (4) is called the characteristic equation of the recurrence relation 
(1). It has r roots, some of which may be complex (but we shall initially assume 
that there are no multiple roots). 

If a, Q), ..., @, are the r roots of Eq. (4), then, for any 1,0 =i <r, 
a, = a" is a solution to the recurrence relation (1). It is easy to check that any 


€ 
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linear combination of such solutions is also a solution (Exercise 8). That is, 


a, = Aaj + Asai t+ ++ +A,a? (5) 
is a solution to Eq. (1), for any choice of constants A;, 1 = i =r. 

Recall that for a recurrence relation involving a,_), @,-», ..., G,-, on the 
right side, we need to be given the initial conditions of the first r values 
ao, a, a, ..., @,-;. Let us denote such a set of initial values by 
aj, @|,43,...,4a;-;. Then the A;s must be chosen to satisfy the r constraints: 

Ajyak + Aakt+-+++A,atb =a} O<ksr-1 (6) 


The r linear equations in Eq. (6) can be solved by Gaussian elimination to 
determine the r constants A; (remember that at this stage the a,;s are known). 
With the A;s determined, we will have the desired solution for a,, a solution 
that satisfies Eq. (4), and hence the recurrence relation (1), and satisfies the 
initial conditions ay = aj,a,=a{, ..., a,-; = aj_,. If the solution of the 
characteristic equation (4) has a root a,, of multiplicity m, then ay, no, na", 
..., 2"Ya@" can be shown to be the m associated individual solutions to be 
used in Eqs. (5) and (6). 


Example 1 


Every year Dr. Finch’s rabbit population doubles. He started with six rabbits. 
How many rabbits does he have after eight years? After n years? 

If a, is the number of rabbits, then a, satisfies the relation a, = 2a,_;. We 
are also given ay) = 6. Substituting a, = a", we obtain a” = 2a""' or, dividing 
by a""', a = 2. Soa, = 2" is the one individual solution, and a, = A2" is the 
general solution. The initial condition is 6 = a) = A2°, or A = 6. The desired 
solution is then a, = 6 X 2". After 8 years, we have a; = 6 X 28 = 6 X 
256 = 1536 rabbits. = 


Example 2 


Solve the recurrence relation a, = 2a,., + 3a,-. with ay) = a; = 1. 
Setting a, = a", we get the characteristic equation: 


a" = 2a"! 4+ 3a"? 
which yields 
a’? = 2a + 3 


This may be written a? — 2a — 3 = 0 or (a — 3a + 1) = 0. That is, the 
roots are +3 and —1. So the basic solutions of the recurrence relation are 
a, = 3” and a, = (—1)", and the general solution is 


a, = A,;3" + Ax-1)" 
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Now we determine A, and A, by using the initial conditions: 
l= ay = A,3° + Ax(—-1)° = A, + A, 
1= a, = A,3! + A,(-1)! a 3A, Pom A2 


We solve these two simultaneous equations to obtain A, = 3, Ar = 3 (add the 
two equations together to eliminate A, yielding 2 = 4A, or A, = 3, and then 
determine A,). The required solution to the recurrence relation with the given 
initial conditions is 


a, =4}X3"+4X(-1)" 0 


Example 3 


Find a formula for the number of ways for the elf in Example 2 of Section 7.1 
to climb the n stairs. 

The recurrence relation obtained in Example 2 of Section 7.1 was the 
Fibonacci relation a, = @,-; + @,-2, with the initial conditions a, = 1, a, = 
2. The associated characteristic equation is obtained by setting a, = a”: 


at= a™! + an? 
which reduces to 
w=atl or a®—-a-1=0 


Using the quadratic formula, we get 


VEIF= DCD = 5 + 
a= a (-1) + V1 — 40) D1 = = (1 & V5) 


That is, we have roots 3 + 3 1/5 5 and 4 — 3 V5, and the general solution of the 


problem is 
il i 
a,=Ai(5+55) +A (5-45) 


Determining A, and A, is left as an exercise. We note the surprising fact 
that to generate the Fibonacci sequence of integers 1, 1, 2, 3, 5, 8, 13,..., we 


need powers of (3 += +V5) and (3-5 == V5). 2 


The following example illustrates a solution of a recurrence relation that 
has complex and multiple roots. 


Example 4 


Find a formula for a, satisfying the relation a, = —2a,-. — d,-4 with a) = 0, 
a, = 1, a, = 2, and a; = ’ 
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Substituting a, = a”, we obtain a" = —2a"? — 3a""4, which yields the 
characteristic equation a* + 2a? + 1 = (a? + 1) = 0. The roots of this 
equation are a = +i and a@ = —i (where i = V-—1) and each root has 


multiplicity 2. Recall that when @ is a double root, the associated recurrent 
relation solutions are a, = a” and a, = na”. So the general solution is 


a, = Aji" + Agni” + A,(—i)" + Agn(—iy" 
The initial conditions yield the equations 
0 =a) =A,i° + A,0i° + Ax(—i)° + A,O(—iY 
=A,+0+A;,+0 
1 =a,=A,i' + A,li'+ A x(—i)! + Agl(—i)! 
= 1(A,; + A, — A3 — Aa) 
2 = ay = Aji? + A,2i* + Ax(—i)? + Ay2(-i? 
= —A, — 2A,—A;— 2A, 
3 =a;=A,i3+ A313 + Ax(—i)? + A,3(-i)3 
= i(—A, — 3A, + A; + 3A,) 


| 


Solving these four simultaneous equations in four unknown A;s, we obtain 


33 hiss 3. 1 
eS ne ae Baal ot alae. 


Then the solution of the recurrence relation is 
Lee We Nee Bes 1. d 
= ~—ayn + aw oe n pay aad n + pias ae = n 
a, = 7! ( 5 + i)ni + ai i) ( , i) i)" = 


We remind the reader that for specific values of n, such as n = 12, it is 
easier to determine a, in the two preceding examples by recursively calculating 
3, Q4, As Up to a). from the recurrence relation than to solve the initial-condition 
equations. 


7.3 EXERCISES 


Mh 1, If $500 is invested in a savings account earning 8 percent a year, give a 


formula for the amount of money in the account after n years. 


2. Find and solve a recurrence relation for the number of n-digit ternary 
sequences with no consecutive digits being equal. 


3. Solve the following recurrence relations: 
(a) a, = 34,-1 So 4a,-2, ay = ay = 1 
(b) ay, = G,-2, Ag = A, = 1 
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(c) a, = 2a,-; — ay-2, ay = a, = 2 
(d) a, = 3a,-1 — 3aq-2 + Gn-3, 4) = a; = 1a, =2 


4, Determine the constants A, and A; in Example 3. First show that the initial 
conditions a, = 1, a, = 2 are equivalent to the initial conditions ay = 1, 
a, = 1. 

5. Find and solve a recurrence relation for the number of ways to arrange 
flags on an n-foot flagpole using three types of flags: red flags 2 feet high, 
yellow flags 1 foot high, and blue flags 1 foot high. 

6. Find and solve a recurrence relation for the number of ways to make a 
pile of n chips using red, white, and blue chips and such that no two red 
chips are together. 


7. Find and solve a recurrence relation for P,, the value of a stock market 
indicator that obeys the rule that the change this year (from the previous 
year) equals twice last year’s change. Suppose py) = 1, p; = 4. 

8. Show that any linear combination of solutions to Eq. (1) is itself a solution 
to Eq. (1). 

9. Show that if the characteristic equation (4) has a root a, of multiplicity 
3, then n/a, for j = 0, 1, 2, are solutions of Eq. (1). 

10. Show that if F,, is the mth Fibonacci number in the Fibonacci sequence 
starting Fy = F, = 1, then 
lim Fst = 1 + V5 
nao F, 2 
11. If the recurrence relation a, = c\a,-, + C2a,-2 has a general solution a, = 
A,3" oF A26", find (on and C2. 


—————____y 


7.4 SOLUTION OF INHOMOGENEOUS 
RECURRENCE RELATIONS 


A recurrence relation is called homogeneous if all the terms of the relation 
involve some a,. In the preceding section, we presented a method for solving 
any homogeneous recurrence relation of the form a, = c,a,-, + Cr7An-y +t 
+ c,a,-,. When an additional term involving a constant or function of n appears 
in the recurrence relation, such as 


a, = Ca,-) + f(n) - (1) 


where c is a constant and f(m) is a function of , then the recurrence relation 
is said to be inhomogeneous. 
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In this section, we discuss methods for solving inhomogeneous recurrence 
relations of the form of Eq. (1). The key idea in solving these relations is that 
a general solution for an inhomogeneous relation is made up of a general 
solution to the associated homogeneous relation [obtained by deleting the f() 
term] plus any one particular solution to the inhomogeneous relation. 

For Eq. (1) the homogeneous relation is a, = ca,-;, whose general solution 
is a, = Ac". Suppose a* is some particular solution of Eq. (1); that is, a* = 
cax., + f(n). Then we see that a, = Ac” + a* satisfies Eq. (1): 


a, = Ac" + a*¥ 
= Ac" + {ca*, + f(n)] 
c(Ac"! + a*®,) + f(n) = ca,-; + f(n) 


The constant A in the general solution is chosen to satisfy the initial condition, 
as in the previous section (A cannot be determined until a* is found). 

There is one special case of Eq. (1) that we can restate as an enumeration 
problem treated in previous chapters. If c = 1, then Eq. (1) becomes 


a, = An-| + f(a) (2) 
We can iterate Eq. (2) to get 


a, =a t+ f(t) 
a, = a, + f(2) = [a + f(1)] + £2) 
a3 = a, + f(3) = [ao + f(L) + f(2)] + f3) 


A, = ay + f(1) + f(2) +f) +++ +f (0) = ay + df) 


So a, is just the sum of the f(k)s plus ao. In Sections 5.5 and 6.5 we presented 
methods for summing functions of n. Either method can be used to solve 


Eq. (2). 


Example 1 


Solve the recurrence relation a, = a,-, + n with initial condition a, = 2, 
obtained in Example 3 of Section 7.1, for the number of regions created by n 
mutually intersecting lines. 

The initial condition of a; = 2 can be replaced by the initial condition 
a = 1 (no lines means that the plane is one big region). By the foregoing 
discussion, we see that a, = 1 + (1 +2 +3 +.-+-+-+ n). The expression to 
be summed can be written as 


()eQeQeQ) a 
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+ 
By identity (7) of Section 5.5, this sum equals (" > ' = sn + 1). Then 


a, = 1 + Sain +. 2 


When c # 1 in Eq. (1), there are known solutions to Eq. (1) to use for specific 
functions f(n), similar to the situation for ‘‘divide-and conquer’’ recurrences 
presented in Section 7.2. We present a table for the simplest f(n)s. These 
solutions can be derived by generating function methods introduced in the next 
section. 


f(n) Particular solution p(n) 
d, a constant B 

dn Bin + By 

dn? Bn? + Bin + By 

ed" Bd" 


The Bs are constants to be determined. If f(n) were a sum of several different 
terms, we would separately solve the relation for each separate f(n) term and 
then add these solutions together to get a particular solution for the compos- 
ite f(n). 

There is one case in which the particular solution for S(”) = ed" will not 
work. This involves the relation a, = da,.,; + ed"—note here that the homoge- 
neous solution a, = Ad" has the same form as f(n). Then one must try ax = 
Bnd" as the particular solution. 


Example 2 


Solve the recurrence relation a, = 2a,-, + 1 with a, = | obtained in Example 
4 of Section 7.1 for the number of moves required to play the n-ring Tower 
of Hanoi puzzle. The general solution to the homogeneous equation a, = 2a,-; 
is a, = A2". 

We find a particular solution to the inhomogeneous relation by setting 
a; = B [this is the form of a particular solution given in the foregoing table 
when f(7) is a constant]. Substituting in the relation, we have 


B=a* =2a*,+1=2B+1 or B=-1 


So a; = —1 is the particular solution, and the general inhomogeneous solution 
is a, = A2" + a¥ = A2" — 1. We now can determine A from the initial 
condition: 1 = a, = A2' — 1, or 1 = 2A — 1. Hence A = 1, and the desired 
solution is a, = 2” — 1. = 
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Example 3 


Solve the recurrence relation a, = 3a,-, — 4n + 3 X 2" to find its general 
solution. Also find the solution when a, = 8. 

The general solution to the homogeneous equation a, = 3a,-; is a, = 
A3". We solve for a particular solution of the relation separately for each 


inhomogeneous term. For a, = 3a,-1 — 4n, we try the form a* = Bin + Bo, 
obtaining 
Bin + By = a® = 3a%, — 4n = 3[B,\(n — 1) + By] — 4n. (4) 


We now equate the constant terms and the coefficients of n on each side of 
Eq. (4): 
Constant terms: By = —3B, + 3Bo (5) 
n terms: Bin = 3B\n — 4n or B, = 3B,-4 (6) 
Solving for B, in Eq. (6), we obtain B, = 2. Substituting B, = 2 in Eq. 
(5), we obtain By = 3. So a¥ = 2n + 3 is a particular solution of a, = 
3ay-| — An. 
Next for a, = 3a,-; + 3 X 2", we try a7 = B2", obtaining 


B2" = at = 3a7_, + 3 & 2" = 3(B2"') + 3 x 2 (7) 
Dividing both sides of (7) by 2”', we get 2B = 3B + 6, or B = —6. So 
a; = —6 X 2" is a particular solution of a, = 3a,-; + 3 X 2". Combining our 


particular solutions with the general homogeneous solution, we obtain the 
general inhomogeneous solution 


a, = A3" + 2n +3 - 6X 2" 
When a, = 8, we can determine A: 
8 =a, =A3'+ 21) +3 —-6%X2! 
=3A-7 
Thus A = 5, and the solution is 5 X 3° + 2n + 3 — 6 X 2". = 


7.4 EXERCISES 


Mb 1. Solve the following recurrence relations: 


(a) a, = a,-, + 3(n — 1), a = 1 
(b) a, = a,-; + n(n — 1), a = 3 
(c) dy = Gy) + 3’, ayo = 10 
2. Find and solve a recurrence relation for the number of infinite regions 


formed by 7 infinite lines drawn in the plane so that each pair of lines 
intersects at a different point. 
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- (a) Find and solve a recurrence relation for the number of different square 


subboards of any size that can be drawn on an n X n chessboard. 
(b) Repeat part (a) for rectangular subboards of any size. 


. Find and solve a recurrence relation for the number of different regions 


formed when nv mutually intersecting planes are drawn in three-dimensional 
space such that no four planes intersect at a common point and no two 
planes have parallel intersection lines in a third plane. [Hint: Reduce to a 
two-dimensional problem (Example 1).] 


. Find and solve a recurrence relation for the number of regions into which 


a convex n-gon is divided by all its diagonals, assuming no three diagonals 
intersect at a common point. (Hint: Sum the inhomogeneous term using a 
special case of an identity from Section 5.5.) 


. If the average of two successive years’ production $(a, + d,-1) is 2n + 5 


and a, = 3, find a,,. 


- Solve the recurrence relation a, = 1.08a,-; + 100, a) = 0, from part (b) 


of Example 5 in Section 7.1. 


. Suppose a savings account earns 5 percent a year. Initially there is $1000 


in the account, and in year k, $10k are withdrawn. How much money is 
in the account at the end of n years if: 


(a) Annual withdrawal is at year’s end? 
(b) Withdrawal is at start of year? 


- Solve the following recurrence relations: 
(a) a, = 3a,-; — 2, a) = 0 (c) a, = 2a,-; + n, a = 1 
(b) a, = 2a,-; + (—1)", a = 2 (d) a, = 2a,-; + 2n?, ay = 3 

. Solve the recurrence relation a, = 3a,-; + n’? — 3, with a = 1. 

- Solve the recurrence relation a, = 3a,-, — 2a,-2 + 3, a) = a, = 1. 

. Find and solve a recurrence relation for the number of n-digit ternary 
sequences in which no 1 appears to the right of any 2. 

. Find and solve a recurrence relation for the earnings of a company when 


the rate of increase of earnings increases by $10 X 2" in the kth year from 
the previous year, where a) = 20 and a, = 1020. 


- Show that the general solution to any inhomogeneous linear recurrence 


relation is the general solution to the associated homogeneous relation plus 
one particular inhomogeneous solution. 


- Show that the form of the particular solution of Eq. (1) given in the table 


in this section is correct for: 
(a) f(2) = d (b) f(n) = dn 
(ce) f(n) = dr’ (d) f(n) = a” 
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16. Show that if f(7) in Eq. (1) is the sum of several different terms, a particular 
solution for this f(72) may be obtained by summing particular solutions for 
the individual terms. 


17. Find a general solution to a, — Sa,-, + 6a,-2 = 2 + 3n. 


18. If the recurrence relation a, — Ci@,-; + C2Q,-.2 = c3n + Cy has a general 
solution a, = A,;2" + A,5" + 3n — 5, find c,, C), C3, C4. 


19. Solve the following recurrence relations when a) = 1: 
(a) a2 = 2a?_, + 1 (Hint: Let b, = a?.) 
(b) a, = —na,_, + n! (Hint: Define an appropriate b, as in part (a).] 


7.5 SOLUTIONS WITH GENERATING 
FUNCTIONS 


Most recurrence relations for a, can be converted into an equation involving 
the generating function g(x) = ay + ayx + +++ + a,x" + +++. This associated 
functional equation for g(x) can often be solved algebraically and the resulting 
expression for g(x) expanded in a power series to obtain a, as the coefficient 
of x". Some of the algebraic manipulations of the functional equations may be 
new to the reader. 

We will treat g(x) as if it were a standard single variable, such as y, and 
treat other functions of x as constants. For example, the functional equation 
g(x) = x’g(x) — 2x can be solved by rewriting the equation as g(x)(1 — x?) = 
—2x and hence g(x) = —2x(1 — x’)"'. Similarly, the functional equation 


(1 — x°\Lg@)P — 4xg(x) + 4x7 =0 () 


can be solved by the quadratic formula that we normally apply to equations 
such as ay* + by + c = 0. Nowa = (1 — x’), b = —4x, and c = 4x’. 
Intuitively, for each particular value of x, g(x) is the solution of Eq. (1). Thus, 
by the quadratic formula, the solution to Eq. (1) is 


! 1G a ERT GT 1 
g(x) = Wd — x) [4x = V 16x? — 16x7(1 = x’)] = 0d = xy + 4x2) 


So g(x) = 2 + x*)/(1 — x’) or 2x — x*\/(1 — x’). If there are two (or more) 
possible solutions, only one will normally make sense as a generating function 
for a, (e.g., have a power series expansion with the correct value for the initial 
condition ap). 

Now let us show by example how recurrence relations for a, can be con- 
verted into functional equations for an associated generating function. 


7.5 Solutions With Generating Functions 307 


Example 1 


Find a functional equation for g(x) = a) + ax +--+: + a,x" + +++ where 
a, satisfies the recurrence relation a, = a,_; + n, when n = 1, obtained in 
Example 3 of Section 7.1. The initial condition was a) = 1. Solve the functional 
equation and expand g(x) to find a,. 

Using this recurrence relation for every term in g(x) except ao, we have 
a,X" = a,x" + nx", n = 1, Summing the terms, we can write 


B() — ay = Saget = S. (ay ix" + ne" (2) 
aSoareS (ew 
= 28(0) + 7 (5) 


Line (3) is obtained from line (2) by breaking up the sum of the two x" terms 
into sums of each term, and by rewriting @,-,x" as xa,_,x""' (in order to make 
the power of x correspond with the subscript of a,-,). Line (4) is obtained from 
line (3) by reindexing the first sum with m = n — 1, and by adding the 
‘‘phantom’’ (zero) term Ox° to the second sum and rewriting n as C(n, 1). The 
first series is the generating function g(x) multiplied by x. The second series 
has a generating function obtained by the construction presented in Section 
6.5. Equating line (5) with g(x) — ap [the left side of line (2)] and setting 
a) = 1, we have the required functional equation for g(x): 


g) —1= 8G) +7 (6) 
Solving for g(x), we rewrite Eq. (6) as 
f, x oth x 
g(x) — xe(x) = 14+ Goa or ax) -—x)=1+ (i —xy 
Thus 
1 
g(x) = * 


+ ——_——. 
(l= x)" (b= xp 
The coefficient of x” in (1 — x)” is just 1 and in x(1 — x)" is C(m— 1) + 
3-1n-DY)=Cat+1,n- 1) = C(v + 1, 2). Then 


ae x _v,,ufnt1\,_< n+1 7 
Sean Ga ee +3( 2 ): ->[1+( 2 ) fs 


n=0 n=0 
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and so a, = 1 + C(n + 1, 2)—the same answer as obtained for this recurrence 
relation in Example 1 of Section 7.4. = 


Example 2 


Use generating functions to solve the recurrence relation a, = @,-; + @,-2, with 
a, = 1, a) = 2 obtained in Example 2 of Section 7.1. 

The initial conditions, a; = 1, a, = 2 are equivalent to a = la, = 1. 
Then using the same power series summation approach as in the previous 
example, we obtain 


fos] oO 
B(x) — ay — ax = >» a,x" = > (Qy—1X" + An-2X") 
n=2 n=2 
ao wo 
=2 Ge ee Didnt? 
n=2 n=2 


= ya ae > a,x! 
m=} k=0 
= x[(g(x) — ao] + x8 (x) 
Setting a) = 1 and a, = 1, we have the functional equation 
g(x) — 1 —x=xf g(x) — t+ 22g@) or g(x) — xg(x) —x’8@) — 1 = 0 
_ and so 
gx) == x)= 1 or g(x) = 1 — x — x’) 

Observe that this denominator is closely related to the characteristic equa- 
tion of this recurrence relation, given in Example 3 of Section 7.3. Jn general, 
g(x) will have a denominator | + cx + Ox? + +++ + ¢,x" if and only if x" 
4 cyt! + ex? + +++ + ¢, is the characteristic equation of the associated 
recurrence relation, and so 1 — ax is a factor of the denominator of g(x) if 
and only if a is a root of the characteristic equation. The numerator will depend 


on the initial conditions. 
By the quadratic formula, we can factor 


l—-x—-x =[1-40 + V5)xq{ — 40 — V5)x] 


For simplicity, let us define a, = 301 + V5) and a) = 3(1 — V5). Then we 
have 


1 2 a,/V5 - anlV'5 


(1 — ayx)(1 — ax) ~t—ax 1—ax 


g(x) = (7) 

The decomposition of g(x) into two fractions in Eq. (7) is obtained by the 
method of partial fractions, the reverse process of combining two fractions into 
a common fraction. Setting y = a,x in the first fraction on the right side in 
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Eq. (7), we have 
aVS | 1 )-> oC eee 
ia V5 — py > atx 


The same type of expansion exists for y = a,x. Then a,, the coefficient of x” 
in the power series expansion of g(x), is 


i ess 1 
a, = ait! ——=as"! 
Ver 
I (: + say" 1 (: - ey" 
se ET co a 
V5\ 2 V5\ 2 
Observe that it is much easier to determine a) by recursively computing 
a, @3,... Up tO aq using the Fibonacci relation than by setting n = 10 in the 
foregoing formula for a,. 
Example 3 
Let g,(x) be a family of generating functions g,(x) = Gig + Qyyx t°+ + 
AnpX* + +++ + Ay, X" Satisfying the relation an, = Qq_-i4 + Qy-14-1, With ayo = 


Ann = 1 (and a,, = 0, k > n) obtained in Example 9 of Section 7.1. Find a 
functional relation among the g,(x)s and solve it to obtain a formula for a,,. 
Using the power series summation method, we obtain 


8x) —1= > Angx* = > (Gy—1pX" + Ay—14-1X") 
=I =I 


n n~-| 
= ay iyx' +x Oy aioe 
k=1 h=0 
= Bn-1(x) — 1 + X8n-1(X) 
Thus 


SX) = Sn-i(X) + XSn-1X) = (1 + x) gn) 
The resulting recurrence g,(x) = (1 + x)g,-;(x) is solved just like the recurrence 
a, = ca,-,. The solution is 
| go(x) = (1 + xJ"go(x) = (1 + 2) 
since go(x) = ayo = 1. Now by the binomial theorem, we have a, = C(n, k). = 
The rest of this section involves more complicated computations. The reader 


may skip this material. First we consider a nonlinear recurrence relation and 
solve it using generating functions. 
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Example 4 


Find a recurrence relation for a,, the number of ways to place parentheses to 
multiply the n numbers k, X k) X k; X kyX +++ X k, on a calculator. Solve 
for a, using generating functions. 

To clarify the problem, observe that there is one way to multiply (k; X k2); 
SO a) = 1. There are two ways to multiply k, X k, X k;, namely, [(k, * k)) X 
k,] and [k, X (k, X k3)]; so a3 = 2. It is not clear what a) and a, should be, but 
to make the eventual recurrence relation have a simple form we let a) = 0 and 
a, = 1. To find a recurrence relation for a,, we look at the last multiplication (the 
outermost parenthesis) in the product of the n numbers. This last multiplication 
involves the products of two multiplication subproblems: 


(ky X ky X +++ XK) X (Kis X king X 0 X hy) 


where i can range from | to n — 1. The numbers of ways to parenthesize the 
two respective subproblems are a; and a,-;, and so there are a;a,-; ways to 
parenthesize both subproblems. Summing over all i, we obtain the recurrence 
relation (for n = 2) 


Qn = AAn-) + A2An-2 +o + AiAn-7 Fo Ay y 
Observe that if g(x) = ay + ax + +++ + a,x" + ---, then the right-hand 
side of this equation is simply the coefficient of x", for n = 2, in the product 
g(x)g(x) = (O+ axt+++++a,x" +--+). Using the power series summation 


method, we have (recall that aj = 0 and a, = 1) 


g(x) - x= > a,x" = > (AyQn—) + QpQn-2 + °° * + a,-1))x" = [g(@x)P 


Solving this quadratic equation in g(x) as described at the start of this section, 
we obtain g(x) = (1 + VL— 4x). To make a = 0 [i.e., g(0) = 0], we want 
the solution g(x) = 3 — V1 — 4x. 

This g(x) requires a new type of generating function expansion called 
the generalized binomial theorem. The power series expansion (1 + y)? = 


(3) + (*): + (4) feet (2) +--+, where q is any real number, 


has a coefficient (*) of x" defined as 


(1) = #¢- Nga en @) 


n n! 


[The formula for this generalized coefficient (2) arises from the Taylor series 


for (1 + y)*%; see any standard calculus text.] 
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Using Eq. (8), the coefficient of x" in V1 — 4x is [we think of V1 — 4x 
as (1 + y)'?, where y = —4x]: 


(?)- aye = KBD x Hn ~ 39) 
n ny 
= 1 X1X3X5X--+ x Qn-3),, 
n} 


een 2 
aay) ©) 
The last step in Eq. (9) is obtained by multiplying certain numbers in the 


numerator by appropriately selected powers of 2 (details are left to Exercise 
6). Multiplying the final expression in Eq. (9) by —3, we obtain the coefficient 


1 
a= 1(*— 2) n=15 
n\n-1 


of x" in —3V 1 — 4x, 

The expression 3C(2n — 2, n — 1) arises in various combinatorial settings 
and is called the nth Catalan number. Next let us consider generating functions 
for simultaneous recurrence relations. 


(~4y 


Example 5 


Use generating functions to solve the set of simultaneous recurrence relations 
obtained in Example 11 of Section 7.1. 


= = -1 — a | 
Qn = An-1 + By-1 + Cn-1) b, = 3" ~ Cn-ts Cy, = 3” = Dias 
Qq = b, = 0 = 1 


Let A(x), B(x), and C(x) be the generating functions for a,, b,, and Cn, Tespec- 
tively. We use the power series summation method to obtain 


io] wo fo) ioe) 
A(x) — a = > a,x" = > Qy-\x" + > by-\x" + Sy Cy-1x" 
n=l n=] n=1 n=1 


= xD gx x > byx™ +x > Cmx™ 
= xA(x) + xB(x) + xC(x) 
Bix) — bb = > b,x" = > Bn lyr — > reer 
= x(1 — 3x)7! — xC(x) 
CQ) -— co = > CnX" = > Br iy — > b,x" 
= x(1 — 3x)! — xB(x) 
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It is always desirable to state initial conditions in terms of ap, bo, co. Solving 
our three recurrence relations for a, bo, Co given a, = b, = c, = 1, we get 1 
= b, = 3° — cy or Cy = O. Similarly we find that b) = 0 and a) = 1. Then our 
functional equations are 


A(x) — 1 = xA(x) + xB(x) + xC(x) 


or 
A(x) = i L {xB(x) + xC(x) + 1] (10) 
and 
x 
B(x) = l-3x xC(x) (11) 
C(x) = Er — xB(x) (12) 


We can solve Eqs. (11) and (12) for B(x) and C(x) simultaneously. Multi- 
plying Eq. (12) by x and using this expression for xC(x) in Eq. (11), we have 


x 


= x — = ot = — 
B(x) = ae xC(x) ia oF Gy 2809 | 


= B(x)(1 — x’) = ce a 


G=x)x x __4 + 
d—x)\1-3x) (+x)1-3x) 1-3x 1l+x 


=> B(x) = 


The last step is again a partial fraction decomposition. The coefficient of x” in 
41 — 3x)! is 33" and in —3(1 + x)! is —3(—1)’. So b,, the coefficient of x” 
in B(x), is 4[3"” — (—1)"]. Equations (11) and (12) are symmetric with respect 
to B(x) and C(x), and so C(x) = B(x) and c, = b, = 4[3" — (—1)’]. 

Next we solve for A(x) in Eq. (10): 


: [xB(x) + xC(x) + 1] = a 


A(x) = B(x) + at 
1-x 


1-x 
[since B(x) = C(x)] 


ee a eee ee 
1—x\1l-—3x 1+x 1-x 
bk } 


(l-x(i—3x) 1—-x?' 1-x 


1 1 
4 3X 1 
= 7 + 
Ins i) 1-x? 1-x 


Ale 
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The coefficient of x" in #1 — 3x)! is 8", in — 40 — x)! is —}, in 


- 7 (1 — x°y"' is —2, n odd, or 0, even, and in (1 — x)"! is 1. Collecting 


these terms, we get a, = 3(3" + 3), n even, and = 4(3" + 1), n odd. » 


7.5 EXERCISES 
M 1. Find functional equations for the generating functions whose coefficients 


satisfy the relations: 
(a) a, = a,-; + 2, a = 1 
(b) a, = 3a,-; — 2a,-. + 2, a) =a, = 1 
(c) a, = a), + n(n — 1), ay = 1 
(d) a, = 2a,-; + 2", a) = 1 
2. Solve the recurrence relations in Exercise 1 using generating functions. 


3. Find functional equations for the generating functions whose coefficients 
satisfy the relations: 


n~1 
(a) a, = >) aa,1-; (1 = 1), a) = 1 
7=0 
n-2 
(b) a, = > Ga,-; (n 2 3),a =a =a =1 
i=2 


n-l 
(c) a, = > 2a, -; (n = 2), a= a= 1 
i=] 


4. Find a functional equation and solve it for the sequence of generating 
functions F(x) = > a,,.x* whose coefficients satisfy (assume Fy = 1 and 
ano = 1): _ 

(8) Gag = Ang. — 24n-1k-1 
(b) any = 20n-14 — 3Anj-1 


5. Verify the form of particular solutions to inhomogeneous recurrence rela- 
tions in the table in Section 7.4. 


6. Verify in Eq. (9) that 


SEXIX3 XS KK On= 3). 2 2n — 2 
n! n\n-—-1 


7. Find a recurrence relation and solve it with generating functions for the 
number of ways to divide an n-gon into triangles with noncrossing diago- 
nals. (Hint: Use reasoning similar to Example 4.) 
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8. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


Find a recurrence relation and associated generating function for the number 
of n-digit ternary sequences that have the pattern ‘‘012’’ occurring for the 
first time at the end of the sequence. 


. Find a recurrence relation and associated generating function for the number 


of different binary trees with 7 leaves. 

Find a recurrence relation for a,,, the number of k-subsets of an n set with 
repetition. Find an equation for F(x) = pa a,.x*, and solve for F,(x) and 
One 


Let a,, be the probability that k successes occur in an experiment with n 
trials if each trial has probability p of success. Find a recurrence relation 
for a,,. Use this relation to find and solve an equation for F(x) = 


n 
> agx" 
k=0 


(a) Find a recurrence relation for a,,, the number of k-permutations of n 
elements. 


(b) Show that F,(x) = > a,,x* satisfies the differential equation [F}_:(x) 
k=0 
denotes the derivative of F,,-,(x)] 


F(x) = (1 + x) Fa) + °F) 


(c) Find a functional equation for G,(x) = >) an4x"/k!. 
k=0 


Find and solve a system of recurrence relations allowing one to determine 
the number of n-digit quaternary sequences with an odd number of 1s and 
an odd number of 2s. 


Find and solve simultaneous recurrence relations for determining the num- 
ber of n-digit ternary sequences whose sum of digits is a multiple of 3. 


(a) Find a recurrence relation for a,, the number of ways to partition n 
distinct objects among n indistinguishable boxes (some boxes may be 


empty). 
(b) Let g(x) = » a,x"/n!, where a) = 1. Show that g(x) satisfies the 
n=0 


differential equation g’(x) = g(x)e*. Solve this equation for g(x). 


(a) Define s,, as numbers such that » i SaaS DES Deere = 
r=0 


r + 1). Find a recurrence relation for s,,,. 
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(b) Find a differential equation for F,(x) = >* s,,x'/r! 
r=0 


7.6 SUMMARY AND REFERENCES 


In this chapter we saw that recurrence relations are one of the simplest ways 
to solve counting problems. Without fully understanding the combinatorial 
process, as was required in Chapter 5, we now need only express a given 
problem for n objects in terms of the problem posed for fewer numbers of 
objects. Once a recurrence relation has been found, then starting with a, (the 
solution for one object), we can successively compute the solutions for 2, 3, 
.-. Up to any (moderate) vaiue of n. Or we can try one of the techniques in 
the later sections of this chapter to solve the recurrence relation explicitly. 

The first recurrence relation in mathematical writings was the Fibonacci 
relation. In his work Liber abaci, published in 1220, Leonardo di Pisa, known 
also as Fibonacci, posed a counting problem about the growth of a rabbit 
population (Exercise 7 in Section 7.1). The number a, of rabbits after n months 
was shown to satisfy the Fibonacci relation a, = a,-; + d,-». As mentioned in 
Section 6.6, DeMoivre gave the first solution of this relation 500 years later 
using the generation function derivation given in Example 2 of Section 7.5. 
The Fibonacci relation and numbers have proven to be amazingly ubiquitous. 
For example, it has been shown that the ratios of Fibonacci numbers provide 
an optimal way (in a certain sense) to divide up an interval when search- 
ing for the minimum of a function in this interval (see Kiefer [3]). The appear- 
ance of Fibonacci numbers in rings of leaves around flowers is discussed in 
Adler [1]. 

The methods for solving recurrence relations appeared originally in the 
development of the theory of difference equations, cousins of differential equa- 
tions. For a good presentation of the methods and applications of difference 
equations, see Sandefur [4]. 


1. I. Adler, ‘‘The consequence of constant pressure in phyllotaxis,’ J. Theoret- 
ical Biology 65 (1977), 29-77. 


2. T. Cormen, C. Leiserson, and R. Rivest, An Introduction to Algorithms, 
McGraw-Hill, New York, 1989. 


3. J. Kiefer, ‘‘Sequential minimax search for a maximum,”’ Proceedings of 
American Math. Society 4 (1953), 502-506. 


4. J. Sandefur, Discrete Dynamical Systems, Oxford University Press, New 
York, 1985. 


CHAPTER 8 
INCLUSION-—EXCLUSION 


8.1 COUNTING WITH VENN DIAGRAMS 


In this chapter we develop a set-theoretic formula for counting problems involv- 
ing several interacting properties in which either all properties must hold or 
none must hold. An example is counting all five-card hands with at least one 
card in each suit, or equivalently, all five-card hands with no void in any suit. 
In the process of solving such problems, we have to count the subsets of 
outcomes in which various combinations of the properties hold. We use Venn 
diagrams to depict these different combinations. See Appendix A.1 for a review 
of the essentials of sets and Venn diagrams. 

Let us begin with a one-property Venn diagram, and then progress to two- 
and three-property problems. In Figure 8.1 we show a set A within a universe 
U. The complementary set A consists of all elements of U not in A. Let N (S) 
denote the number of elements in set S. We define N = N(U). Then N(A) = 
N — N(A), or N(A) = N - N(A). This is similar to the situation in probability 
where the probability of an event E is one minus the probability of the comple- 
mentary event EF. 

Suppose, for example, that there is a ‘‘universe’’ of 100 students in a math 
course and there are 30 students who are not math majors in the course, that 
is, N(A) = 30, where A is the set of math majors. Then the number N(A) of 
math majors in the course is N(A) = N — N(A) = 100 — 30 = 70. 

Next consider a problem with two sets. Let the universe U be all students 
in a school, let F be the set of students taking French, and let L be the set of 
students taking Latin. See Figure 8.2. We want formulas for the number of 
students taking French or Latin N(F U L) and the number taking neither 
language N(F 1M L) in terms of N, N(F), N(L), and N(F O L). Note that 


Figure 8.1 
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cD 


N(F U L) is not simply N(F) + N(L), because N(F ) + N(L) counts each 
student taking both languages two times. Thus, we must know how many 
students take both languages, the number N(F M L). Subtracting N (F OL) from 
N(F) + N(L) corrects the double counting of students taking two languages. That 
is, 


Figure 8.2 


N(F UL) = N(F) + ML) — NF OL) (1) 
By de Morgan’s Law (Eq. BA3 of Appendix A.1), FOL=FUL, and so 
N(FOL)=N(FUL)=N-NFUL) 
Combining this equation with Eq. (1), we have 
NOL) =N-N(FUL)=N-NF)- NL) + NF OL) (2) 


Example 1 


If a school has 100 students with 50 students taking French, 40 students taking 
Latin, and 20 students taking both languages, how many students take no 
language? 

In this problem, N = 100, N(F) = 50, N(L) = 40, and N(F  L) = 20. 
We need to determine N(F  L). By Eq. (2), we have 


N(F NL) =N— NF) — NL) + NWF NL) = 100 — 50 — 40 + 20 = 30 


Example 2 


How many arrangements of the digits 0, 1, 2,..., 9 are there in which the 
first digit is greater than 1 and the last digit is less than 8? 

Before using formula (2) to solve this problem, let us consider a more 
direct approach and see why it fails. For the first digit, there are 8 choices. 
Then for the last digit, there are . . . the number of choices depends on whether 
or not an 8 or 9 was chosen for the first digit. If an 8 or 9 were chosen for the 
first digit, there will be 8 choices for the last digit, while if neither 8 nor 9 
were chosen for the first digit, there will be 7 choices for the last digit. Because 
of this difficulty, we shall solve the problem with formula (2). 

Let U be all arrangements of 0, 1, 2, ..., 9. Let F be the set of all 
arrangements with a 0 or 1 in the first digit, and let L be the set of all arrangements 
with an 8 or 9 in the last digit. Then the number of arrangements with first 
digit greater than 1 and the last digit less than 8 is N(F 2 L). 
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We have N = 10!, N(F) = 2 X 9! (two choices for the first digit followed 
by any arrangement for the remaining 9 digits), and N(L) = 2 x 9! Similarly, 
N(F OL) = 2 X 2 X 8!. Then by Eq. (2), 


NF OL) = 10! — (2X 9!) - (2X 91) + (2K 2X8!) 0 


Figure 8.3 


Consider next a problem with three sets. We extend Figure 8.2 with the 
additional set G of students taking German, as shown in Figure 8.3. We want 
a formula for N(F 1 LO G). A first guess might be 


N(FOALAG)=N-—N(F) — NUL) — NG) 


As in Figure 8.2, this formula double counts (that is, subtracts twice) the 
students in two of the sets, F, L, and G. We can correct this first formula by 
adding the number of students taking two languages. Thus we propose the 
formula 


N(F ALN G)=N — N(F) — N(L) — NG) 
+N(FOL)+NLONG)+N(FNG) (3) 


Figure 8.4 shows how many times a student will be added and subtracted 
in parts of this formula. A student taking no language is counted once (by the 
term N)—such students are exactly the ones we want to count. The challenge 
is to make sure that all other students are counted a net of 0 times. The students 
taking one language are counted once by N and subtracted once by the term 


Se 
Number of. 


Languages + [N(F NL) 
Taken by — (NF) + NL) +NLNG) 
Student N + N(G)] +NFOG)] -NFOLNG) 
0 +1 0 0 0 
1 +1 —1 0) 0 
2 +1 —2 +1 0 
3 +1 —3 +3 —-1 


Figure 8.4 
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—[N(F) + N(L) + NM(G)], for a net count of 0. The students taking two 
languages are counted once by N, subtracted twice by —[N(F) + N(L) + N(G)] 
(since they are in exactly two of the three sets), and then added once by the 
term + [V(F NL) + N(LN G) + N(F A G)] (since they are in exactly one 
of the three pairwise intersections), for a net count of 0. Finally, we consider 
the students taking all three languages. They are counted once by N, then 
subtracted three times by the sum of the three sets (since they are in all three 
sets), then added three times by the pairwise intersections (since they are in 
all three of these subsets). This yields a net count of 1 — 3 + 3 = 1. Then 
we must correct formula (3) by subtracting N(F M LM G) to make the net 
count of students with all three languages 0: 


NF OLNG)=N-— [N(F) + NIL) + N(G)] 
+ [N(FOAL)+NLOAG)+NFONG)-NMFALNG) (4 


For general sets A,, A), A3, we rewrite Eq. (4) as 


N(Ai NA, NA3) = N — >) N(A) + NYA; A;) — NAL. A. As) (5) 
i uj 


where the sums are understood to run over all possible 7 and all i,j pairs, 
respectively. 


Example 3 


If a school has 100 students with 40 taking French, 40 taking Latin, and 40 
taking German, 20 students are taking any given pair of languages, and 10 
students are taking all three languages, then how many students are taking no 
language? 

Here N = 100, N(F) = N(L) = N(G) = 40, N(F OL) = NLNG) = 
N(F M G) = 20, and N(F 0 L 1 G) = 10. Then by Eq. (4), the number of 
students taking no language is N(F M L MN G) = 100 — (40 + 40 + 40) + 
(20 + 20 + 20) — 10 = 30. ss 


Example 4 


How many positive integers = 70 are relatively prime to 70? 

Let U be the set of integers between 1 and 70. The phrase ‘‘relatively 
prime to 70’’ means ‘‘have no common divisors with 70.’’ The prime divisors 
of 70 are 2, 5, and 7. Then we want to count the number of integers = 70 that 
do not have 2 or 5 or 7 as divisors. Let A, be the set of integers in °U that are 
divisible by 2 (and equivalently, integers in U that are multiples of 2), A2 
integers divisible by 5, and A; integers divisible by 7. Then the number of 
positive integers = 70 that are relatively prime to 70 equals N(A, N A, O 
A;). We find 


N= 70 N(A,) = 70/2 = 35 N(A2) = 70/5 = 14 N(A3) = 70/7 = 10 
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The integers divisible by 2 and 5 are simply the integers divisible by 10. 
Thus N(A, M A.) = 70/10 = 7. By similar reasoning, N(A, M A3) = 70/(5 x 
7) = 2, N(A, N A3) = 70/2 X 7) = 5, and N(A, M A) MA3) = 70/2 x 5 xX 
7) = 1. So by Eg. (5): 


N(A, NA, As) = 70 — (35 + 144+ 10) +(7+24+5)-1=24 . 


Example 5 


How many n-digit ternary (0, 1, 2) sequences are there with at least one 0, at 
least one 1, and at least one 2? ; 

Let “U be the set of all n-digit ternary sequences. Formula (5) counts 
outcomes in which none of a set of properties holds. So we must formulate 
this problem in terms of outcomes for which none of a set of properties holds. 
The solution is to define A; to be the number of n-digit ternary sequences with 
no is. (Note that instead of numbering the sets A,, A,, A3, we are using Ao, Aj, 
A;.) Then the number of sequences with at least one of each digit will be 
N(Ap 1 A; N A). 

The number of n-digit ternary sequences is N = 3". The number of n-digit 
ternary sequences with no Os is simply the number of n-digit sequences of 1s 
and 2s. Thus, N(A) = 2". Similarly, N(A,) = N(A,) = 2”. The only n-digit 
sequence with no Os and no Is is the sequence of all 2s. Then N (Ap N. A,) = 
1; also N(A; M Az) = N(Ay N A) = 1. Finally, there is no ternary sequence 
with no Os and no 1s and no 2s. Then by Eq. (5): 


N(Ay NA, NA) = 3" — (2" + 2" + 24+ (1 +14+1)-0 
=3"-3x2"+3 8 


Observe that whereas polynomial algebra was used in Chapter 6 to model 
counting problems and recurrence relations were used in Chapter 7, now we 
are using a set-theoretic model. This approach does not eliminate combinatorial 
enumeration as the other models did. We still must solve the subproblems of 
finding N(A;), N(A; N A,), and so forth, but these are much easier problems. 


Example 6 


Suppose there are 100 students in a school and there are 40 students taking 
each language, French, Latin, and German. Twenty students are taking only 
French, 20 only Latin, and 15 only German. In addition, 10 students are taking 
French and Latin. How many students are taking all three languages? No 
language? 

We draw the Venn diagram for this problem and number each region as 
shown in Figure 8.5. Let N; denote the number of students in region i, for i = 
1,2,..., 8. Students taking only French are the subset F 1 LO G, region 1; 
so N, = 20. Similarly, the other information given us says Ns = 20 and N, = 
15. Students taking both French and Latin are the subset F Q L = (FOALN 
G) U(FNLNG), regions 2 and 3. So N, + N; = 10. The set of students 
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Aon 
Figure 8.5 | ee, 


taking French is F, which consists of regions 1, 2, 3, and 4. So 
40 = N(F) = N, + (N. + Ns) + Ng = 20 + (10) + Ns, or N, = 10 


Similarly, set L consists of regions 2, 3, 5, and 6, and so 40 = N(L) = (N, + 
N3) + Ns + Ns = (10) + 20 + Ng, implying Ne = 10. 

Since G consists of regions 3, 4, 6, and 7, and since we were given N, = 
15 and have just found that N, = N, = 10, then 


40 = N(G) = N; +N, + No +N, =N; + 104+ 10 + 15 or N;=5 


But region 3 is the subset F M L /M G of students taking all three languages. 
Thus, there are five trilingual students. 

The general line of attack in these problems is to break the Venn diagram 
into the eight regions shown in Figure 8.5. For each subset whose size is given, 
write that number as a sum of N;,’s of regions i in that subset. By combining these 
equations (and sometimes solving them simultaneously), one can eventually 
determine the number of elements in each region. Then the size of any subset 
is readily found as the sum of the sizes of the regions in that subset. For 
example, in the preceding paragraph we determined all N;s except N, and Ng. 
But N, + N; = 10 and N; was found to be 5; thus N, = 5. Ng = NFOLOG) 
is the number of students taking no language. Since all regions total to N, 
then 


7 
N,=N—->N,=100-(20+5+5+ 10+ 20+ 10+ 15) 
i=] 


=100-85=15 = 
Note that because there are 8 underlying variables (the sizes of the 8 basic 
regions), we need to be given 8 pieces of information to be able to solve such 
a problem. 


8.1 EXERCISES 


Mo Summary of Exercises The first 24 exercises are similar to 


the examples in this section. The last five exercises involve more complicated 


8.1 Counting With Venn Diagrams 323 


Venn diagram arguments; see the last paragraph in Example 6 for the strategy 
for solving these problems. 


1. 


2. 
3. 


12. 


13. 


14. 


How many 5-letter ‘‘words’’ using the 26-letter alphabet (letters can be 
repeated) either begin or end with a vowel? 


How many 9-digit Social Security numbers are there with repeated digits? 


How many n-digit ternary sequences are there in which at least one pair 
of consecutive digits are the same? 


- What is the probability that at least two heads (not necessarily consecutive) 


will appear when a coin is flipped 8 times? 


- What is the probability that a 5-card hand has at least one pair (possibly 


two pairs, three of a kind, full house, or four of a kind)? 


. Ifn people of different heights are lined up in a queue, what is the probability 


that at least one person is just behind a taller person? 


- How many 4-digit numbers (including leading Os) are there with exactly 


one 8 and no digit appearing exactly two times? 


- Suppose 60 percent of all families own a dishwasher, 30 percent own a 


trash compacter, and 20 percent own both. What percentage of all families 
own neither of these two appliances? 

Among 600 families, 100 families have no children, 200 have only boys, 
and 200 have only girls. How many families have boy(s) and girl(s)? 


. How many arrangements of the digits 0, 1, ..., 9 are there that do not 


end with an 8 and do not begin with a 3? 


. Suppose a bookcase has 200 books, 70 in French and 100 about mathemat- 


ics. How many non-French books not about mathematics are there if: 
(a) There are 30 French mathematics books? 
(b) There are 60 French nonmathematics books? 


How many arrangements of the 26 different letters are there that: 

(a) Contain either the sequence ‘‘the’’ or the sequence ‘‘aid’’? 

(b) Contain neither the sequence ‘‘the’’ nor the sequence ‘‘math’’? 

A school has 200 students with 80 students taking each of the three subjects: 


trigonometry, probability, and basket-weaving. There are 30 students taking 
any given pair of these subjects, and 15 students taking all three subjects. 


(a) How many students are taking none of these three subjects? 
(b) How many students are taking only probability? 
Suppose 60 percent of all college professors like tennis, 65 percent like 


bridge, and 50 percent like chess; 45 percent like any given pair of recre- 
ations. 


(a) Should you be suspicious if told 20 percent like all three recreations? 
(b) What is the smallest percentage who could like all three recreations? 
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15. How many numbers between | and 30 are relatively prime to 30? 
16. How many numbers between 1 and 280 are relatively prime to 280? 


17. How many ways are there to assign 20 different people to three different 
rooms with at least one person in each room? 


18. How many n-digit numbers are there with at least one of the digits 1 or 2 
or 3 absent? [Hint: This is a union problem; find N(A; U A) U A3).] 


19. How many arrangements are there of MURMUR with no pair of consecutive 
letters the same? 


20. If three couples are seated around a circular table, what is the probability 
that no wife and husband are beside one another? 


21,, How many arrangements are there of TAMELY with either T before A, 

a or A before M, or M before E? By ‘‘before,’’ we mean anywhere before, 
not just immediately before. [Hint: This is a union problem; find N(A, U 
A, U A3).] 

22. How many arrangements are there of MATHEMATICS with both Ts before 
both As, or both As before both Ms, or both Ms before the E? Note that 
‘‘before’’ is used as in Example 21. 


The Bernsteins, Hendersons, and Smiths each have 5 children. If the 15 
children of these three families camp out in five different tents, where each. 
tent holds 3 children, and the 15 children are randomly assigned to the five 
tents, what is the probability that every family has at least two of its children 
in the same tent? 


23 


. 


24. Suppose 45 percent of all newspaper readers like wine, 60 percent like 
orange juice, and 55 percent like tea. Suppose 35 percent like any given 
pair of these beverages and 25 percent like all three beverages. 


(a) What percentage of the readers likes only wine? 


(b) What percentage of the readers likes exactly two of these three bever- 
ages? 


25. Suppose that among 40 toy robots, 28 have a broken wheel or are rusted 
but not both, 6 are not defective, and the number with a broken wheel 
equals the number with rust. How many robots are rusted? 


26. Suppose a school with n students offers two languages, PASCAL and 
BASIC. If 30 students take no language, 70 students do not take just 
PASCAL (ie., either they do not take PASCAL or they take both lan- 
guages), 80 students do not take just BASIC, and 20 students take both 
languages, determine n. 


27. Suppose a school with 120 students offers Yoga and Karate. If the number 
of students taking Yoga alone is twice the number taking Karate (possibly, 
Karate and Yoga), if 25 more students study neither skill than study both 
skills, and if 75 students take at least one skill, then how many students 
study Yoga? 
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28. Ina class of 30 children, 20 take Latin, 14 take Greek, and 10 take Hebrew. 
If no child takes all three languages and 8 children take no language, how 
many children take Greek and Hebrew? [Hint: Use formula (5) to determine 
the value of the expression N(L M G) + (LN H) + N(GN #f).) 


29. Suppose among 150 people on a picnic, 90 bring salads or sandwiches, 80 
bring sandwiches or cheese, 100 bring salads or cheese, 50 bring cheese 
and either salad or sandwiches (possibly both), 60 bring at least two foods, 
and 20 bring all three foods. How many people bring just one of the three 
foods? How many people bring sandwiches? 


—v 


8.2. INCLUSION-EXCLUSION FORMULA 


In this section we generalize the formula for counting N(A; MA, A;) to n 
sets A, A,,* ++, A,. To simplify notation, we will omit the intersection symbol 
“1” in expressions and write intersected sets as a product. For example, A; 
A; M A; would be written A,A,A;3. Using this new notation, the number of 
elements in none of the sets A,, A>, -- +, A, will be written N(A\A2° + +.A,). 
The following formula is known as the Inclusion—Exclusion Formula because 
of the way it successively includes (adds) and excludes (subtracts) the various 
k-tuple intersections of sets. 


Theorem 1 Inclusion—Exclusion Formula 


Let A\, A), +++ A,, ben sets in a universe U of N elements. Let S, denote the 
sum of the sizes of all k-tuple intersections of the A,s. Then 


N(A\A,* ++ A,) =N —S,+S.—-S3+++++(-1)', + +++ +(-D’S, (1) 


Proof 


To clarify the definition of the S,s, S; = 2,N(A;), S. = 2;;N(A;A;), S; is the 
sum of the N(A;,A;, °° -A,,)s for all sets of k A;s, and finally Sn = N(A, Az 
"++ A,). We prove this formula by the same method used for N(F L G) in the 
previous section: we shall show that the net effect of Eq. (1) is to count any 
element in none of the sets A; once and to count elements in one or more A ;8 
a net of 0 times. 

If an element is in none of the A;s, that is, is in AjA,---A,, then it is 
counted once in the right-hand side of Eq. (1) by the term N and is not counted 
in any of the S,s. So the count is 1 for each element in A \A.> ++ A,, as required. 
An element in exactly one A; is counted once by N, is subtracted once by S, 
(since it is in one of the A jS), and is counted in none of the other S,s—for a 
count of 1 —1 = 0, as required. Now more generally let us show that an 
element x that is in exactly m of the A;s has a net count of 0 in Eq. (1). Element 
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x is counted once by N, is counted m times by S, (since x is in m A;s), is 
counted C(m, 2) times by S, [since x is in the intersection A;A ; for the C(m, 
2) pairwise intersections involving two of the m sets containing x], ..., and, 
in general, is counted C(m, k) times by S,, k = m. It is not counted in §, when 
h > m. So the net count of x in Eq. (1) is 


GG) G)eteoG)+ ent) @ 


This alternating sum of binomial coefficients can be evaluated from the 
binomial expansion 


(+y"=14 (‘)x+ (se Sse (\3 
feeeg (™) x= (3) 


If we set x = —1 in Eq. (3), the right side of Eq. (3) is now the expression in 
Eq. (2), whereas the left side of Eq. (3) becomes [1 + (—1)]” = 0” = 0. Thus, 
Eq. (2) equals 0, as required. @ 

Corollary 

Let A,, A, - ++ A, be sets in the universe U. Then 


N(A, UA, U ens UA,) =S,—-$,+8, 
(DMS HH ECD, @) 


Proof 
We write formula (1) as 


N(AjA2° ++ A,) = N — [S, — S.+S3— +++ +(-1)"'S,] (5) 


Next we observe that the number of elements in none of the sets equals the 
total number of elements minus the number of elements in one or more sets, 
that is, 


N(A\A,° ++ A,) =N—N(A, UA) U+ ++ UA,) (6) 


Comparing Eqs. (5) and (6), we see that the expression in brackets on the right 
side of Eq. (5) is N(A; U A, U:-- UA,). @ 


Before giving examples of the inclusion—exclusion formula, we want to 
emphasize an important logical point about applying this formula. To use this 
formula in a counting problem, one must select a universe U and a collection 
of sets A; in that universe such that the outcomes to be counted are the subset 
of elements in °U that are in none of the A;s. That is, the A,s represent properties 
not satisfied by the outcomes being counted. 


8.2  Inclusion—Exclusion Formula 327 


Example 1 


How many ways are there to select a 5-card hand from a regular 52-card deck 
such that the hand contains at least one card in each suit? 

The universe UU should be the set of all 5-card hands. We need to define 
the sets A; such that hands with at least one card in each suit are in none of 
the A,;s. With a moment’s thought, we see that at least one card in a suit is 
equivalent to no void in the suit. Thus, we let A, be the set of 5-card hands 
with a void in spades; A, a void in hearts; A; a void in diamonds; and A, a 
void in clubs. Now the question asks for N(A,|AA;A,), and we can use the 
inclusion—exclusion formula. 

We must next calculate N, $,, S2, S;, and S,. As noted in Chapter 5, a 5- 
card hand is simply a subset of 5 cards, and so N = C(52, 5). The size of Aj, 
the set of hands with a void in spades, is simply the number of 5-card hands 
chosen from the 52 — 13 = 39 nonspade cards. So N(A;) = C(39, 5). Likewise, 
N(A;) = C(39, 5), i = 2, 3, 4, and so S$; = 4 X C9, 5). The hands in A,A, 
are hands chosen from the 26 non-(spades or hearts) cards, and so N(A,A>) = 
C(26, 5). There are C(4, 2) = 6 different intersections of two out of the four 
sets, and so S$, = 6 X C(26, 5). There are C(4, 3) = 4 different triple intersections 
of the sets and each has C(13, 5) hands. So §; = 4 X C(13, 5). Finally, a hand 
cannot be void in all four suits, and so S, = 0. Then by Eq. (1), 


ie ie ie 52 39 26 13 
= - + ~ + 
We note that, in general, S, is a sum of C(n, k) different k-tuple intersections 
of the n A;s. 


Example 2 


How many ways are there to distribute r distinct objects into five (distinct) 
boxes with at least one empty box? 

Let U be all distributions of r distinct objects into five boxes. Let A; be 
the set of distributions with a void in box i. Then the required number of 
distributions with at least one void is N(A, U A, U-++ As). We have N = 5’, 
N(A;) = 4° (distributions with each object going into one of the other 4 boxes), 
N(A,A;) = 3’, and so forth. As just noted, there are C(5, k) subsets in S,, k = 
1,2,..., 5. Thus by Eq. (4), 


N(A, UA, U +++ As) = 8, —S,+8;—-S,4+ Ss 


“()e-QeQ)e-(ree. 


It is easy to mistake union problems, which use phrases such as ‘‘with at 
least one empty box,’’ with standard inclusion—exclusion problems, which use 
phrases such as ‘‘at least one object in every box.’’ The former ask for at least 
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one of a set of properties to hold, whereas the latter ask for every property to 
hold. Moreover, the latter problems must be solved by using the complementary 
properties, such as ‘‘box i is empty,’’ and determining all ways for none of 
these complementary properties to hold. The reader has to reason carefully 
through a counting problem and determine whether to frame the solution as 
counting outcomes where none of a set of properties hold or outcomes where 
one or more properties hold. 

Another common source of confusion is the subscripts of the S;s and the 
subscripts of the A;s. In example 1, students sometimes define A, to be all 
hands with a void in one suit, A, to be all hands with a void in two suits, and 
so on. This is wrong. The subscript of the As is an ordinal (ordering) number. 
A, denotes the first set (in Example 1, the set of all hands with a void in spades), 
A, the second set, A; the third set, .... The subscript of the Ss is a cardinal 
(magnitude) number. S$, is the sum of the sizes of all single sets, S, the sum 
of the sizes of all pairwise intersections of sets, $; the sum of the sizes of all 
3-way intersections of sets, .... 


Example 3 
How many different integer solutions are there to the equation 
X; +X. + x3 + x4 + X5 + X65 = 20 0=x,=8 

This type of problem was solved with generating functions in Section 6.2. 
Now we solve it with an inclusion—exclusion approach. Let U be all integer 
solutions with x; = 0, and let A; be the set of integer solutions with x; > 8, or 
equivalently x; = 9. Then the number of solutions with 0 = x; = 8 will be 
N(A\A2 Se ate Ag). 

Recalling the formulas for integer solutions of equations from Section 5.5 


(Example 5) we have 
_ {20+6-1\_ (25 
v=( 20 Jeta) 


To count N(A,), the outcomes with x, = 9, we consider the x;s to be 
amounts of objects chosen when a total of 20 objects are chosen from 6 types. 
The outcomes with at least 9 of the first type can be generated by first picking 
9 of the first type and then picking the remaining 20 — 9 objects without 
restriction from the 6 types. This reasoning applies to all N(A)): 


_ (20-9) +6-1)_ (16 
niay=( (20 — 9) ) a 


By a similar reasoning, now first choosing 9 from two types and then choosing 
the remaining 20 — 9 — 9 objects without restriction, we have 


[OTe VTSa 1 af 
ncaa) = ( (20-9 — 9) (2) 
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For a solution to be in three or more A;s, the sum of the respective x;s would 
exceed 20—impossible. So S; = 0 for j = 3, and 


— BN othe _ (25\ _ (6\/16 6\{7 
N(A,A2+ ++ A.) =N s.+5:= (39) (8) + (8)(2) « 


Recall that to use generating functions to solve the preceding problem, we 
would seek the coefficient of x”° in 


(+txte+-4+ x9 = [11 — x /0 — do = (1 — x1 — 08 


The coefficient or x in this product is Agby + agby, + aigb where a; is the 
coefficient of x‘ in (1 — x)~6 and b;, is the coefficient of x‘ in (1 — x°)®. This 
coefficient of x” turns out to be exactly the foregoing expression for 
N(A,A2°° As). The (1 — x°)’ factor does the inclusion—exclusion task of 
SUbCaSHAE cases where one x; is at least 9 and adding back cases where two 
x; are at least 9. By using generating functions to solve this problem, we did 
not need to know anything about the inclusion—exclusion complexities of this 
problem. Generating functions automatically performed the required inclu- 
sion—exclusion calculations! 


Example 4 


What is the probability that if n people randomly reach into a dark closet to 
retrieve their hats, no person will pick his own hat? 

The probability will be the fraction of outcomes in which no person gets 
his own hat. Our universe °U is all ways for the n people to successively select 
a different hat. So N = n! Let A; be the set of outcomes in which person i gets 
his own hat. Then N(A;) = (n — 1)!, since given that person i gets his hat, the 
number of possible outcomes is all ways for the other n — 1 coop to select 
hats. Similarly N(A;A;) = (n — 2)!, and generally N(A; Aj,° °° A; J=an- 
k)! for k-way intersections. Since S, is a sum of C(n, k) kway terms, we obtain 
by Eq. (1) 


N(A\Az* ++ A,) =N —S, +S) 4+ +++ +(-1)'S, ++ +> +(-)'S, 


=n! — @G —1)f+ (3) —Ql+--- 
+ coo —Alte + -y(”) 0! 


=S(- »'( en ~ bk! 
i=0 \k 
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: n\ _ nt n _ al 
Recalling that . = Hab! we see that @G —ki= ee So 


and now the probability that no person gets his own hat is 


N(A,A2: * -A,) e cut / 1 1 (—1)" 
ee ee eS ! pees 2S = = —— ype ee +A 
N nt > oy ee ee aS 


This alternating series is the first n + 1 terms of >, (—1)*/k!, which is the 
k=0 


power series for e* when x = —1. The series converges very fast. The difference 
between e~' and Eq. (7) is always less than 1/n! For example, e “| = 0.367879 
_.. and for n = 8, the series in Eq. (7) equals 0.367888 (even for n = 5, it is 
0.366). Thus, for all but very small n, the desired probability is essentially e |. 
The answer is virtually independent of n. = 


The problem treated in Example 4 is equivalent to asking for all permuta- 
tions of the sequence 1, 2,..., such that no number is left fixed, that is, no 
number / is still in the ith position. Such rearrangements of a sequence are called 
derangements. The symbol D,, is used to denote the number of derangements of 
n integers. From Example 4, we have 


=nie! 


n ¢_4y\k 
D, =n! > ae 
fo =k! 


For exact values of D,,, it is usually easier to use the following simple recurrence 
relation: 


D, = nD,-; + (-1)" n=2 


See Exercise 34 for details on deriving this recurrence. 


Example 5 


How many ways are there to color the four vertices in the graph shown in 
Figure 8.6 with n colors (such that vertices with a common edge must be 
different colors)? 

We label the edges 1, €2, €3, €4, @s, aS Shown in Figure 8.6. The universe 
should be the set of all ways to color the vertices. So N = n‘ (n choices for 
each vertex). The situation we must avoid is having adjacent vertices the same 
color. For each edge e;, we define the set A; to be all colorings of the graph 
in which the vertices at either end of edge e; have the same color. Then 
N(A;) = n} (since the two vertices connected by e; get one color and the two 
other vertices each get any color), and S$; = 5 X n>, Similarly, N(A;A;) = 7” 
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cal 


&2 &5 


Figure 8.6 3 


and S$, = C(5, 2) X n’. A little care must be used with three-way intersections. 
Specifically, edges e), @, e; interconnect three, not four, vertices, and so 
N(A,A,A;3) = n? (one color for vertices x), x., x; and one color for vertex x,). 
Likewise, N(A3A4As) = n’. Any other set of three edges interconnect all four 
vertices, and so the associated N(A;A;A;) equals n. Then $; = 2 X n? + [C(5, 
3) — 2] X a. Four or five edges interconnect all four vertices, and so S, = 
C(5, 4) X n and S; = n. Then the answer to our problem is 


NG AAA AD = nt —($)n+ (3) 
5 5 
- 2 —2)xXn|+ - 
PG) le) 

=n*— 5n>+ 8n?—4n 2 

Note that the preceding expression is the chromatic polynomial of this graph 
(see Example 7 in Section 5.2). 

We conclude this section with two generalizations of the inclusion—exclu- 


sion formula. Many readers may want to skip this material. 


Theorem 2 

If A,, Ax, ..., A, are n sets in a universe U of N elements, then the number 
N, of elements in exactly m sets and the number N% of elements in at least m 
sets are given by: 


+ 
Nn = Sn — % . Ret + é ae feed cayre(*)s 
m m m 


feet ce(")s, (8) 


pieiee m m+1 or take eed 
Ni Sm (2 ae S42 + + ( 1) m—-1 Sx 


freed cre(™7 is (9) 
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Proof 


The formula for N,, can be proved in a fashion similar to the proof of the 
inclusion—exclusion formula. We must show that elements in more than m sets 
are counted a net of O times in Eq. (8). In this formula the count is slightly 
trickier to sum. Readers who dislike such technicalities should skip this proof. 

The count for an element in r sets, m = r = n, is C(r, &) in S,, and so the 
net count of this element in Eq. (8) is 


CeCe) 
oucetea( )t) a 


Remember that the element is not counted in S, for k > r. Recall from Example 
1 of Section 5.5 that the number of ways to pick k objects from r and then 
pick m special objects from those k is equal to the number of ways to pick the 
m special objects from the r first and then pick k — m more from the remaining 


yy -(geeg 


Using this substitution, Eq. (10) now becomes 


()-(Q)ar) ++ eon) 
pe renee(\(r=™ 


m r—m 


6S red es or ee ae 


As in Theorem 1, the expression in brackets here is just the expansion of 
(1 + x)" with x = —1. So Eq. (10) sums to 0, as required. 

The formula for V* can be verified with induction by showing that formula 
(9) for NX satisfies N* = N,, + N*,,: ‘‘In at least m sets’’ means ‘‘in exactly 
m sets’’ or “‘in at least m + 1 sets.”? @ 


Example 6 


Find the number of 4-digit ternary sequences with exactly two Is. Also find 
the number with at least two 1s. 

Let U be all 4-digit ternary sequences. Let A; be 4-digit ternary sequences 
with a 1 in position 7. Then N = 34 and S, = C(4, 1)3°, S, = C(4, 2)3’, $3 = 
C(4, 3)3', and S, = C(4, 4)3° = 1. Now by formulas (8) and (9), the desired 
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numbers are 


nos-()s+(}e-(>-Qp]-h-a 
wres-(}se(}eelE>-CLOp] Cpa 


The observant reader should recall that N, can be computed directly by a simple 
combinatorial argument. 


8.2. EXERCISES 


Mb Summary of Exercises These exercises require a combination 


of inclusion—exclusion modeling and Chapter 5 type of enumeration skills (to 
solve the subproblems, some of which are tricky). Exercises 35—45 use Theorem 


1. How many n-digit decimal sequences (using digits 0, 1, 2,..., 9) are there 
in which digits 1, 2, 3 all appear? 
. How many ways are there to roll 10 distinct dice so that all 6 faces appear? 
. What is the probability that a 10-card hand has at least one 4 of a kind? 
. How many positive integers = 420 are relatively prime to 420? 
. What is the probability that a 13-card bridge hand has: 
(a) At least one card in each suit? 


ma b&b HW N 


(b) At least one void in a suit? 
(c) At least one of each type of face card (face cards are Aces, Kings, 
Queens, and Jacks)? 
6. Given five pairs of gloves, how many ways are there for five people to 
each choose two gloves with no one getting a matching pair? 


7. How many arrangements are there of a, a, a, b, b, b, c, c, c, without three 
consecutive letters the same? 


8. Given 2n letters, two of each of n types, how many arrangements are there 
with no pair of consecutive letters the same? 


9. How many permutations of the 26 letters are there that contain none of 
the sequences MATH, RUNS, FROM, or JOE? 


10. How many integer solutions of x, + x. + x; + x, = 30 are there with: 
(a) OS x; = 10? 
(b) —10 = x; = 20? 
(c) O = x, x, = 5, x, = 10, x; = 15, x, S 21? 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19, 


20. 


21. 


How many ways are there to distribute 25 identical balls into six distinct 
boxes with at most 6 balls in any of the first three boxes? 


How many 4-digit numbers (including leading Os) are there with no digit 
appearing exactly two times? 


How many ways are there for a child to take 12 pieces of candy with four 
types of candy if the child does not take exactly two pieces of any type 
of candy? 


Santa Claus has five toy airplanes of each of n plane models. How many 
ways are there to put one airplane in each of r (r = n) identical stockings 
such that all models of planes are used? 


A wizard has five friends. During a long wizard’s conference, it met any 
given friend at dinner 10 times, any given pair of friends 5 times, any 
given threesome of friends 3 times, any given foursome 2 times, and all 
five friends together once. If in addition it ate alone 6 times, determine 
how many days the wizard’s conference lasted. 


How many secret codes can be made by assigning each letter of the alphabet 
a (unique) different letter? 


How many ways are there to distribute 10 books to 10 children (one to a 
child) and then collect the books and redistribute them with each child 
getting a new one? 


How many arrangements of 1, 2, ..., ” are there in which only the odd 
integers must be deranged (even integers may be in their own positions)? 


How many ways are there to assign each of five professors in a math 
department to two courses in the fall semester (i.e., 10 different math 
courses in all) and then assign each professor two courses in the spring 
semester such that no professor teaches the same two courses both semes- 
ters? 


Consider the following game with a pile of n cards numbered 1 through 
n. Successively pick a different (random) number between 1 and n and 
remove all cards in the pile down to, and including, the card with this 
number until the pile is empty. If the chosen card number has already been 
removed, pick another number. What is the approximate probability that 
at some stage the number chosen is the card at the top of the pile? (Hint: 
Approach as an arrangement with repetition.) 

The rooms in the circular house plan shown below are to be painted using 


eight colors such that rooms with a common doorway must be different 
colors. In how many ways can this be done? 


8.2 Inclusion—Exclusion Formula 335 


22. The four walls and ceiling of a room are to be painted with five colors 
available. How many ways can this be done if bordering sides of the room 
must have different colors? 


23. How many ways are there to color the vertices with n colors in the following 
graphs such that adjacent vertices get different colors? 


“OE "ag 


24, (a) How many arrangements of the integers 1 through v are there in which 
i is never immediately followed by i + 1, fori = 1,2,...,”— 1? 


(b) Show that your answer equals D, + D,,1. 


25. Repeat Exercise 24(a) now considering n to be followed by 1. 


26. How many ways are there to seat n couples around a circular table such 
that no couple sits next to each other? 


27. If two identical dice are rolled n successive times, how many sequences 
of outcomes contain all doubles (a pair of 1s, of 2s, etc.)? 


28. How many arrangements of a, a, a, b, b, b, c, c, c have no adjacent 
letters the same? (Hint: This is tricky—not a normal inclusion—exclusion 
problem.) 


29. If there are n families with five members each, how many ways are there 
to seat all 5” people at a circular table so that each person sits beside another 
member of his/her family? (Hint: This is not a standard inclusion—exclusion 
problem but uses a type of inclusion—exclusion argument.) 


30. Suppose that a person with seven friends invites a different subset of three 
friends to dinner every night for one week (seven days). How many ways 
can this be done so that all friends are included at least once? 

31. How many ways are there to distribute r distinct objects into n indistinguish- 
able boxes with no box empty? 

32. Find the number of ways to give each of six different people seated in a 
circle one of m different types of entrees if adjacent people must get 
different entrees. 

33. Determine N, in Example 6 by a direct combinatorial argument. 


34. (a) Show that D,, satisfies the recurrence D,, = (n — 1)(D,-, + D,-2). 


(b) Rewriting the recurrence in part (a) as D, — nD,_, = —[D,-, — (n—- 
1)D,-2], iterate backwards to obtain the recurrence D, = nD,_, + 
(-1)". 


(c) Use part (b) to make a list of D, values up to n = 10. 
(d) Use part (a) to show that D(x) = e*(1 — x)! is the exponential 
generating function for D,,. 
a 
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35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


How many ways are there to distribute r distinct objects into n distinct 
boxes with exactly three empty boxes? With at least three empty boxes? 
How many ways are there to deal a six-card hand with at most one void 
in a suit? 

How many ways are there to arrange the letters in INTELLIGENT with 
at least two consecutive pairs of identical letters? 

If n balls labeled 1, 2, ..., n are successively removed from an urn, a 
rencontre is said to occur if the ith ball removed is numbered i. If the n 
balls are removed in random order, what is the probability that exactly k 
rencontres occur? Show that this probability is about e~'/k! 

Show that the number of ways to place r different balls in n different cells 
with m cells having exactly k balls is 


(-1)"aIr!< (n — j)"* 
1)’ ; 
m6 Gomi ple =e! 
(a) If g(x) is the (ordinary) generating function for N,, (see Theorem 2), 


show that g(x) = > S,(x — 1)‘. This g(x) is called the hit polynomial. 
k=0 


(b) Show that 2[g(1) + g(—1)] is the number of elements in an even 
number of Ajs. 


(c) Use part (b) to determine the number of n-digit ternary sequences with 
an even number of Os. Simplify your answer with a binomial expansion 
summation. 


Show that 


= x (<) 


k=m \M 


Use a combinatorial argument (with inclusion—exclusion) to prove: 


(a) 5-4") ~ (ea) mt+r<n 
(b) Sep) =0 
() Sco(") “ ee ') 
@ Scr e to) 


Use Theorem 2 to show that 


()-Zcnr (A) 
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44, Prove formula (9) in Theorem 2. 


45. In Example 4, let the random variable X = number of people who get their 
own hat. Find F(X). 


———+___—_y 


8.3. RESTRICTED POSITIONS AND 
ROOK POLYNOMIALS 


In this section we consider the special problem of counting arrangements of n 
objects when particular objects can appear only in certain positions. We will 
solve one such problem involving five objects in careful detail. In the process 
we will indicate how to generalize our analysis to any restricted-positions 
problem. 

Consider the problem of finding all arrangements of a, b, c, d, e with the 
restrictions indicated in Figure 8.7. That is, a@ may not be put in position 1 or 
5; b may not be put in 2 or 3; c not in 3 or 4; and e not in 5. There is no 
restriction on d. A permissible arrangement can be represented by picking five 
unmarked squares in Figure 8.7, with one square in each row and each column. 

For example, the permissible arrangement badec corresponds to picking squares 
(a, 2), (b, 1), (c, 5), (d, 3), (e, 4). 

When viewed in terms of the 5 X 5 array of squares, the arrangement 
problem can be thought of as a matching problem, matching letters with posi- 
tions. In Section 4.3 we also considered matching problems. There we wanted 
to determine whether any complete matching existed. Here we want to count 
how many complete matchings there are. 

We use the inclusion—exclusion formula, expression (1) of the previous 
section, to count the number of permissible arrangements for Figure 8.7. Let 
Ul be the set of all arrangements of the five letters without restrictions. So 
N = 5}. Let A; be the set of arrangements with a forbidden letter in position i 
(note that we could equally well define the properties in terms of the th letter 


then be N(A, A, A; A; As). In terms of Figure 8.7, A; is the set of all collections 


Positions 


Figure 8.7 


338 Chapter 8 Inclusion—Exclusion 


of five squares, each in a different row and column such that the square in 
column i is a darkened square. We obtain N(A;) by counting the ways to put 
a forbidden letter in position i times the 4! ways to arrange the remaining four 
letters in the other four positions (we do not worry about forbidden positions 
for these letters). Then N(A,) = 1 X 4!, N(A2) = 1 X 4!, M(A3) = 2 X 41, 
N(Aq) = 1 X 4!, and N(As) = 2 X 4!. Collecting terms we obtain 


5 
S,= MA) =1X 414141424141 414+2X4! 
i=l 


=(1+14+2+1+2)41=7X4! 


Observe that (1 + 1 + 2 + 1 + 2) = 7 is just the number of the darkened 
squares in Figure 8.7. Since each choice of a darkened square (i.e., some letter 
in some forbidden position) leads to 4! possibilities, then 


S, = (number of darkened squares) X 4! 


for any restricted-positions problem with a 5 X 5 family of darkened squares 
similar to Figure 8.7. 

Next, N(A,A,) will be the number of ways to put (different) forbidden letters 
in positions i and j times the 3! ways to arrange the remaining three letters. 
Or equivalently, the ways to pick two darkened squares, one in column i and 
one in column j (and in different rows), times 3!. The reader can check that 


N(A\A2) =1X3!  N(A\A3)=2X3! NAA) =1X3! 
N(A\As) =1X3! 9 N(ApA3) = 1X3! N(AAg) = 1 X 3! 
N(AnAs) = 2X3!  N(AxAg) = 1X3! N(A3A5) = 4 X 3! 
N(A,As) = 2 X 3! 


Collecting terms, we obtain 


S.= NGA) = (1+ 2+ 1414141424144 + 2)3! = 16 x3! 
ty 


The number 16 counts the ways to select two darkened squares, each in a 
different row and column. Generalizing, we will have 


number of ways to pick k darkened squares 
St ( each in a different row and column GO —k) (1) 


Since letter d’s row in Figure 8.7 has no darkened squares, there is no 
way to pick five darkened squares, each in a different row (and column). Thus 
Ss = 0. On the other hand, tedious case-by-case counting apparently awaits us 
for S; and S,. Instead, let us try to develop a theory for determining the number 
of ways to pick k darkened squares, each in a different row and column. 

This darkened squares selection problem can be restated in terms of a 
recreational mathematics question about a chess-like game. A chess piece called 
a rook can capture any opponent’s piece in the same row or column of the 
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rook (provided there are no intervening pieces). Instead of using a normal 
8 X 8 chessboard, suppose we ‘‘play chess’’ on the “‘board’’ consisting solely 
of the darkened squares in Figure 8.7. 

Counting the number of ways to place & mutually noncapturing rooks on 
this board of darkened squares is equivalent to our original subproblem of 
counting the number of ways to pick k darkened squares in Figure 8.7, each 
in a different row and column. The phrase “‘k mutually noncapturing rooks’’ 
is simpler to say and more pictorial. 

A common technique in combinatorial analysis is to break a big messy 
problem into smaller manageable subproblems. We will now develop two 
breaking-up operations to help us count noncapturing rooks on a given board 
B. The first operation applies to a board B that can be decomposed into disjoint 
subboards B, and B,, that is, subboards involving different sets of rows and 
columns. Often a board has to be properly rearranged before the disjoint nature 
of the two subboards can be seen. 

When the rows and columns of Figure 8.7 are rearranged as shown in 
Figure 8.8, it is obvious that the three darkened squares in rows a and e and 
columns 1 and 5 are disjoint from the four darkened squares in rows b and c 
and columns 2, 3, and 4. Let B be the board of darkened squares in Figure 8.8, 
let B, be the three darkened squares in rows a and e, and let B, be the four 
darkened squares in rows b and c. 

Define r,(B) to be the number of ways to place k noncapturing rooks on 
board B, r,(B,) the number of ways to place & noncapturing rooks on subboard 
B,, and r,(B,) the number of ways to place & noncapturing rooks on subboard 
B,. There are three ways to place one rook on subboard B, in Figure 8.8, since 
B, has three squares, and similarly four ways to place one rook on subboard 
B,. Then r,(B,) = 3 and r,(B,) = 4. A little thought shows that there is only 
one way to place two rooks on subboard B, and three ways to place two rooks 
on subboard B,, so that r,(B,) = 1 and r,(B,) = 3. Note that r,(B,) = r,(B2) 
= 0 for k = 3, since each subboard has only two rows. It will be convenient 
to define rp = 1 for all boards. 

Observe next that since B, and B, are disjoint, placing, say, two noncapturing 
rooks on the whole board B can be broken into three cases: placing two 
noncapturing rooks on B, (and none on B,), placing one rook on each subboard, 


Positions 


Letters 


Figure 8.8 
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or placing two noncapturing rooks on B,. Thus we see that 
r{B) = r(By) + r(By)ri(B2) + ro(B2) 
or, using that fact that 7,(B.) = 7(B,) = 1, 
r(B) = r(B))ro(B2) + r(Bi)r\(B2) + ro(B)r(B2) (2) 
=1X1+3X4+1x3=16 
Recall that 16 is the number obtained earlier when summing all N(A;A;)s to 
count all ways to pick two darkened squares each in a different row and column. 


The reasoning leading to Eq. (2) applies to r,(B) for any & and for any 
board B that decomposes into two disjoint subboards B, and B,. 


Lemma 
If B is a board of darkened squares that decomposes into the two disjoint 
subboards B, and B,, then 


r(B) = r(By)ro(B2) + n-(Byri(B2) + ++ + (Bi ri(B2) (3) 


The observant reader may notice that Eq. (3) is very similar to formula (6) 
in Section 6.2 for the coefficient of a product of two generating functions. That 
is, if f(x) = Da,x’ and g(x) = =b,x’, then the coefficient of x* in h(x) = f(x) g(x) 
is aby + a,b + +++ + aod. We will now exploit this similarity. 

We define the rook polynomial R(x, B) of the board B of darkened squares 
to be 

R(x, B) = r(B) + r(B)x + r{B)xeP +++: 


Remember that 7,(B) = 1 for all B. Note that the rook polynomial depends 
only on the darkened squares, not on the size of the original assignment diagram. 
Then for B, and B, as defined above, we found that 


R(x, B,) = 1 + 3x + 1x? and R(x, B.) = 1 + 4x + 3x? 


Moreover, by the correspondence between Eq. (3) and the formula for the 
product of two generating functions, we see that r,(B), the coefficient of x* in 
the rook polynomial R(x, B) of the full board, is simply the coefficient of x‘ 
in the product R(x, B,)R(x, B). That is, 
R(x, B) = R(x, B,) R(x, By) = (1 + 3x + 1x?) + 4x + 3x’) 
=14+(6X1)+0X4]x+[0X1)+6X4+ X 3))x’ 
+[d X 4) + @G X 3)? 4+ (1 X 3)x4* 
= 1+ 7x + 16x? + 13x? + 3x4 


This product relation is true for any such B, B,, and By. 


Theorem 1 


If B is a board of darkened squares that decomposes into the two disjoint 
subboards B, and B, then 


RG, B) = RG, By) RG, Bz) 
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Without meaning to belittle the role of generating functions in Theorem 
1, we should observe that the generating functions were used here solely because 
polynomial multiplication corresponds to the subboard composition rule given 
in Eq. (3). That is, plugging the 7,(B;)s into two polynomials and multiplying 
them together is a more familiar way to organize the computation required by 
Eq. (3). Shortly we will decompose a board B into nondisjoint subboards, and 
rook polynomials will then play a truly essential role. 

Now that R(x, B) has been determined for the board of darkened squares 
in Figure 8.8 and hence in Figure 8.7, we can solve our original problem about 
the permissible arrangements of a, b, c, d, e. Expression (1) for S, can now be 
rewritten 


S, = r(B)S — k)! 
By the inclusion—exclusion formula, the number of permissible arrangements 
is 
N(A,A2A3A4As) = N — S, + Sy — S3 + Sy — Ss 
= 5! —7(B)4! + r,(B)3! — r,(B)2! + r4(B)1! — r5(B)0! 
=5!-7xX4!+16X3!—-13X2!+3xI1!-0x0! 
= 120 — 168 + 96 — 26+3-0=25 
The values for the r,(B)s came from the rook polynomial R(x, B) computed 
above. This rook-polynomial-based variation on the inclusion—exclusion for- 
mula is valid for any arrangement problem with restricted positions. 


Theorem 2 
The number of ways to arrange n distinct objects when there are restricted 
positions is equal to 
ni — r(B)(n — 1)! + r(B)(n — 2)! +--+ + (— 1)‘ (Bn — bk)! 

+ +++ 4+ (-1)'r,(B)0! (3) 
where the 7,(B)s are the coefficients of the rook polynomial R(x, B) for the 
board B of forbidden positions. 


Let us summarize the little theory we have developed. 


1. Given a problem of counting arrangements or matchings with restricted 
positions, display the constraints in an array with darkened squares for 
forbidden positions, as in Figure 8.7. 

2. Try to rearrange the array so that the board B of darkened squares can be 
decomposed into disjoint subboards B, and B). 

3. By inspection, determine the r,(B;)s, the number of ways to place & noncap- 
turing rooks on subboard B;. 

4. Use the r,(B;)s to form the rook polynomials R(x, B,) and R(x, B), and 
multiply R(x, BRC, Bz) to obtain RG, B). 

5. Insert the coefficients r,(B) of R(x, B) in formula (3). 
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Figure 8.9 _ 


Example 1 


How many ways are there to send six different birthday cards, denoted C), C2, 
C;, Cy, Cs, Cs, to three aunts and three uncles, denoted A,, A, A3, U;, U2, Us, 
if aunt A, would not like cards C, and C,; if A, would not like C, or Cs; if A; 
likes all cards; if U,; would not like C,; or Cs; if U, would not like C,; and if 
U; would not like C,? 

The forbidden positions information is displayed in Figure 8.9. We 
rearrange the board by putting together rows with a darkened square in the 
same column, and putting together columns with a darkened square in the same 
row. For example, rows C, and C; both have darkened squares in columns A, 
and U,, and so we put rows C, and C; beside one another and columns A, and 
U, beside one another; similarly for rows C, and C, and columns A, and U). 
We get the rearrangement shown in Figure 8.10. Thus the original board B of 
darkened squares decomposes into the two disjoint subboards, B, in rows C, 
and C;, and B, in rows C2, C4, and C,. Actually B, itself decomposes into two 
disjoint subboards B; and BZ, where By is the single square (C,, U3). By 
inspection we see that 


R(x, B,) = 1 + 4x + 2x? 
R(x, B.) = R(x, B3)R(x, BZ) = (1 + 3x + x?)(1 + x) 
So 


R(x, B) = RG, BYR, Br) 
= (1 + 4x + 2x*)(1 + 3x + x?)(1 + x) 
= 14 8x + 22x? + 25x? + 12x4 + 2x? 


Ag U, Az Ai U2 U3 


EE 
Figure 8.10 “6 (rs a 
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Figure 8.11 


Then the answer to the card mailing problem is 


6 
> (—1)'r,(B)(6 — &)! 


=6!-8x5!+22xX4!-25X3!+12x2!-2x1!+0x0! 
= 720 — 960 + 528 — 150+ 24-—2+0=160 om 


Now let us consider the problem of determining the coefficients of 
R(x, B) when the board B does not decompose into two disjoint subboards. 
Consider the board B shown in Figure 8.11. Let us break the problem of 
determining r,(B) into two cases, depending on whether or not a certain square 
s is one of the squares chosen for the k noncapturing rooks. Let B, be the board 
obtained from B by deleting square s, and let B* be the board obtained from 
B by deleting square s plus all squares in the same row or column as s. If s is 
the square indicated in Figure 8.11, then B, and B* are as shown in Figure 
8.12. If square s is not used, we must place k noncapturing rooks on B.,. If 
square s is used, then we must place k — 1 noncapturing rooks on B*. Hence 
we conclude that 


r(B) = r(By) + r-1(B#) (4) 


Using the generating function methods introduced in Section 7.5 for turning 
a recurrence relation into a generating function, we obtain from Eq. (4) 


R(x, B) = > 7 (B)x* = > (Bt + DS) n(B¥) x! 
k 


k k 
= >, r(B,)x* + x > r,(B*)x!" 
k h 
= R(x, B,) + xR(x, B*) 


Observe that B, and B* both break into disjoint subboards whose rook polynomi- 
als are easily determined by inspection: 


Figure 8.12 
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Ag U, A, U2 U3 


C4 


Figure 8.13 “ 


R(x, By) = (1 + 3x\(1 + 2x) = 14 5x + 6x2 
RG, BS) = (14+ 291 +) = 14 3x4 2x 
R(x, B) = R(x, B,) + xR(x, B*) = (1 + 5x + 6x”) + x(1 + 3x + 2x’) 
= 1+ 6x + 9x? + 2x3 


These results apply to any board B and any square s in B. 


Theorem 3 


Let B be any board of darkened squares. Let s be one of the squares of B, and 
let B, and B* be as defined above. Then 


R(x, B) = R(x, B,) + xR(B*) 


Theorem 3 provides a way to simplify any board’s rook polynomial. If the 
boards B, and B* do not break up into disjoint subboards, we can reapply 
Theorem 3 to B, and B¥. It is important that the square s be chosen to split up 
B as much as possible. 


Example 2 


In Example 1, suppose that the tastes of uncle U,; change and now he would 
not like card C, but would like C,;. The new board of forbidden positions is 
shown in Figure 8.13. The square in the bottom left corner, call it ¢, is still disjoint 
from the other squares, call them board B,. Board B, cannot be decomposed into 
disjoint subboards, and so we must use Theorem 3. The square s that breaks 
up B, most evenly is (C,, U,), indicated in Figure 8.13. 

Boards B, and B¥ shown in Figure 8.14 both decompose into simple disjoint 
subboards: 


RG, B,) = (1 + 3x4 x*)(1 + 3x +47) = 14+ 6x4 11x? + 6x7 4+ x4 
Ri, B¥) = (14+ 2x) + 2x) = 14+ 4x 4+ 4x? 


+ 


Figure 8.14 


8.3 Restricted Positions and Rook Polynomials 345 


Then 
R(x, B}) = R(x, B,) + xR(x, B*) = (1 + 6x + 11x? + 6x? + x4) 
+ x(1 + 4x + 4x?) 
= 1+ 7x + 15x? + 10x? + x4 
and 


R(, B) = RO, BRO, t) 
= (1 + 7x + 15x? + 10x? + x41 + x) 
= 1+ 8x 4+ 22x? 4+ 25x? + I1xt +5 


Now by Theorem 2, the number of ways to send birthday cards is 
6! — 8 X 5! + 22 X 41 - 25 X 31+ 11 X 2!- 1x 1! +0 X O! = 159 


Note that uncle U,’s.change in tastes changed the final rook polynomial only 
slightly: 7,(B) changed from 12 to 11, r;(B) changed from 2 to 1, and the final 
answer changed from 160 to 159. = 


8.3. EXERCISES 


Mb Summary of Exercises The first seven exercises are similar 


to the examples in this section; the next seven exercises develop theory about 
rook polynomials and combinatorial theory based on rook polynomials. 


1. Describe the associated chessboard of darkened squares for finding all 
derangements of 1, 2, 3, 4, 5. 


2. Find the rook polynomial for the following boards: 
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3. 


10. 


Seven dwarfs D,, D., D3, D4, Ds, Ds, D7 each must be assigned to one of 
seven jobs in a mine, J), J2, J3, J,, Js, Js, J7. D,; cannot do jobs J; or J3; 
D, cannot do J; or Js; D, cannot do J; or Js; Ds cannot do J, or J;; Dz 
cannot do J,. D; and D, can do all jobs. How many ways are there to 
assign the dwarfs to different jobs? ‘ 


. A pair of two distinct dice are rolled six times. Suppose none of the ordered 


pairs of values (1, 5), (2, 6), (3, 4), (5, 5), (5, 3), (6, 1), (6, 2) occur. What 
is the probability that all six values on the first die and all six values on 
the second die occur once in the six rolls of the two dice? 


. A computer dating service wants to match four women each with one of 


five men. If woman 1 is incompatible with men 3 and 5; woman 2 is 
incompatible with men 1 and 2; woman 3 is incompatible with man 4; and 
woman 4 is incompatible with men 2 and 4, how many matches of the 
four women are there? 


. Suppose five officials O;, O., O3, Os, Os; are to be assigned five different 


city cars, an Escort, a Lexus, a Nissan, a Taurus, and a Volvo. O, will not 
drive an Escort or a Nissan; O, will not drive a Taurus; O; will not drive 
a Lexus or a Volvo; 0, will not drive a Lexus; and O; will not drive an 
Escort or a Nissan. If a feasible assignment of cars is chosen randomly, 
what is the probability that: 


(a) O; gets the Volvo? 


(b) O, or Os; get the Volvo? (Hint: Model this constraint with an altered 
board.) 


. Calculate the number of words that can be formed by rearranging the letters 


EERRIE so that no letter appears at one of its original positions, for 
example, no E as the first, second or sixth letter. 


. Find the rook polynomial for a full n X n board. 
. An ascent in a permutation is a consecutive pair of the form i, i + 1. The 


ascents in the following permutation are underlined: 12534. 


(a) Design a chessboard to represent a permutation of 1, 2, 3, 4, 5 so that 
a check in entry (i, /) means i is followed immediately by j in the 
permutation. Darken entries so as to exclude all ascents. 


(b) Find the rook polynomial for the darkened squares in part (a). 

(c) Find the number of permutations of 1, 2, 3, 4, 5 with k ascents. (Hint: 
See Theorem 2 in Section 8.2.) 

Let R,m(x) be the rook polynomial for an n X m chessboard (n rows, m 

columns, all squares may have rooks). 

(a) Show that Ram) = Ra-im() + mxRy-1,m-1) 


(b) Show that = Ran (X) = nmR,-1m-1(x). [Hint: Use identity (5) in Section 
5.5.] 


11. 


12. 


13. 


14. 
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Find two different chessboards (not row or column rearrangements of one 

another) that have the same rook polynomial. 

Consider all permutations of 1, 2, ..., in which i appears in neither 

position i nor i + 1 (m notin or 1). Such a permutation is called a menage. 

Let M,(x) be the rook polynomial for the forbidden squares in a menage. 

Let M*(x) be the rook polynomial when n may appear in position 1, and 

let M°(x) be the rook polynomial when both 1 and n may appear in posi- 

tion 1. 

(a) Show that M*(x) = xM#,(x) + M&(x), Mi) = xM0-1(x) + MFO), 
and M,(x) = M#(x) + xM%,(Q). 

(b) Using the initial conditions M#(x) = 1 + x, M °(x) = 1, show by 
induction that 


n _ aol a oo 

M#(x) = ~ & ‘) xt and M°(x) = > a x : xk 
0 k k 

(c) Find M,(x). 


Given an n X m chessboard C,,,, (see Exercise 10) and a board C of 
darkened squares in C,,,, the complement C’' of C in C,,» is the board of 
nondarkened squares. 


(a) Show that 


k . : 
ices any lt IAI os 
nC) = D ( 1) es Game GO) 
(b) Show that R(x, C') = x"R(1/x, C). 


Use Theorem 2 in Section 8.2 to derive a formula for counting arrangements 
when exactly k elements appear in forbidden positions. 


8.4 SUMMARY AND REFERENCE 


Frequently the combinatorial complexities in the counting problems in Chapter 
5 arose from simultaneous constraints such as ‘‘with at least one card in each 
suit.’’ In this chapter, we formed a simple set-theoretic model for such problems 
and solved once and for all the combinatorial logic of this model. The resulting 
formula, the inclusion—exclusion formula, was then applied to various counting 
problems. This formula requires the proper set-theoretic restatement of a prob- 
lem and the solution of some fairly straightforward subproblems, but in return 
the formula eliminates all the worry about logical decomposition (and worry 
about counting some outcomes twice). After having no help in their problem- 
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solving in Chapter 5, readers should find it easy to appreciate fully the power 
of a formula that does much of the reasoning for them. This is what mathematics 
is all about! 

The last section on rook polynomials provides a nice mini-theory about 
organizing the inclusion—exclusion computations in arrangements with 
restricted positions. The inclusion—exclusion formula was obtained by J. Sylves- 
ter about 100 years ago, although in the early eighteenth century the number 
of derangements had been calculated by Montmort and a (noncounting) set 
union and intersection version of the formula was published by De Moivre. Rook 
polynomials were not invented until the mid-twentieth century (see Riordan [1]). 


1. J. Riordan, An Introduction to Combinatorial Analysis, John Wiley & Sons, 
New York, 1958. 


PART THREE 


ADDITIONAL 
TOPICS 


CHAPTER 9 


POLYA’S ENUMERATION 
FORMULA 


9.1 EQUIVALENCE AND SYMMETRY GROUPS 


In this chapter we examine a special class of counting problems. Consider the 
ways of coloring the corners of a square black, white, or red: There are three 
color choices at each of the four corners, giving 3 X 3 X 3 X 3 = 81 different 
colorings. Suppose, however, that the square is not in a fixed position but is 
unoriented, like a square molecule in a liquid. Now how many different corner 
colorings are there? The unoriented figure being colored could be a n-gon or 
a cube, and the edges or faces could be colored instead of the corners. The 
floating square problem is equivalent to finding the number of different (unori- 
ented) necklaces of four beads colored black, white, or red. Polya’s motivation 
in developing his formula for counting distinct colorings of unoriented figures 
came from a problem in chemistry, the enumeration of isomers. 

The difficulty in these problems comes from the geometric symmetries of 
the figure being colored. We develop a special formula, based on this set of 
symmetries, to count all distinct colorings of a figure. With a little more work, 
we also obtain a generating function that gives a pattern inventory of the 
distinct colorings. For example, the pattern inventory of black—white colorings 
of the corners of a cube with all geometric symmetries allowed is 


b§ + bw + 3b5w? + 3b°w? + 7htwt + 3b°w> + 3b°w® + bw? + w8 


where the coefficient of b'w‘ is the number of nonequivalent colorings with i 
black corners and j white corners. 

We develop our formula and associated theory around the sample problem 
of coloring the corners of an unoriented square (floating in three dimensions) 
with black and white. It is impossible to draw an unoriented object; any picture 
shows it in a fixed position. Thus, we start our analysis with the 2* = 16 
black-white colorings of the fixed square. See Figure 9.1. We can partition 
these 16 colorings into subsets of colorings that are equivalent when the square 
is floating. There are six such subsets (see the groupings of colored squares 
shown in Figure 9.1), and so there are six different 2-colorings of the floating 
square. The set of fixed colorings would be too large if a harder sample problem 
were used, such as 2-colorings of a cube or 3-colorings of the square. 

351 | 


352 Chapter 9 Polya’s Enumeration Formula 


O O 
O O 
© O O O O O 
O O O O O O 
O O © O O O O O 
O O O O O O O O 


Figure 9.1 


We seek a theory and formula to explain why there are six such distinct 
2-colorings of the square. Note that the six subsets of equivalent colorings vary 
in size. 

To define the partition of a set into subsets of equivalent elements, we first 
define the general concept of the equivalence of two elements a and b. We 
write this equivalence as a ~ b. The fundamental properties of an equivalence 
relation are: 

(i) Transitivity: a~ b,b~c>a~c. 

(ii) Reflexivity: a ~ a. 
(iii) Symmetry: a~ b> b~ a. 
All other properties of equivalence can be derived from these three. Any binary 
relation with these three properties is called an equivalence relation. Such a 
relation defines a partition into subsets of mutually equivalent elements called 
equivalence classes. 


Example 1 


(a) For a set of people, being the same weight is an equivalence relation; all 
people of a given weight form an equivalence class. 


(b) For a set of numbers, differing by an even number is an equivalence 
relation; the even numbers form one equivalence class and the odd numbers 
the other class. 


(c) For a set of figures, having the same number of corners is an equivalence 
relation; for each n, an equivalence class consists of all n-corner figures. = 


Next we turn our attention to the motions that map the square onto itself (see 
Figure 9.2). These motions are what make the foregoing colorings equivalent to 
another one. Before we develop a theory about symmetries and their relation 
to coloring equivalence, let us take a closer geometric look at the symmetries 
of a square and some other figures. 
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Example 2 


Figure 9.2 displays the set of motions that map the square onto itself, the 
symmetries of the square. How was this set obtained? More generally, what is 
the set of symmetries of an n-gon, for n even? 

The symmetries of the square divide into two classes: rotations—circular 
motions in the plane; and reflections (flips)—-motions using the third dimension. 
The rotations, about the center of the square, are easy to find. Each rotation is 
an integral multiple of the smallest (nonzero) rotation. The rotations are 7) = 
90° rotation, 7; = 180° rotation, 7, = 270° rotation, and 7, = 360° (or 0°) 
rotation. 

Reflections are a little harder to visualize since reflections are motions in 
three dimensions. The reflections are 7; = reflection about the vertical axis, 
75 = reflection about the horizontal axis, 7; = reflection about opposite corners 
a and c, and 7; = reflection about opposite corners b and d. 

In a regular n-gon, the smallest rotation is (360/n)°. Any multiple of this 
(360/n)° rotation is again a rotation, and so there are n rotations in all. There 
are two types of reflections for a regular even-n-gon: flipping about the middles 
of two opposite sides and flipping about two opposite corners. Since there are 
n/2 pairs of opposite sides and n/2 pairs of opposite corners, a regular even n- 
gon will have n/2 + n/2 = n reflections. Summing rotations and reflections, 
we find that a regular even-n-gon has 2n symmetries. 

We leave it as an exercise to show that these 2n symmetries just described 
are distinct. To show that there are at most 2 symmetries of an even n-gon, 
consider a particular corner, call it x, and the edge e incident to x on the 
clockwise side. The position of x and the relative position of e after the action 
of a symmetry totally determine the symmetry (readers should convince them- 
selves of this). A symmetry could map «x to any of the n corners of the n-gon— 
n choices—and e could be mapped to either side of x—2 choices. In total, 
there are 2n possible different symmetries of the even n-gon. = 


Figure 9.2 
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Example 3 


Describe the symmetries of a pentagon, and more generally, of an n-gon for 
odd a, 

As noted in Example 2, any regular n-gon has n rotational symmetries. A 
pentagon will have five rotational symmetries of 0°, 72°, 144°, 216°, and 288°. 
However, the reflections discussed in Example 2 about opposite sides or opposite 
corners do not exist in the pentagon. Instead, we reflect about an axis of 
symmetry running from one corner to the middle of an opposite side (see Figure 
9.3). There are five such reflections, for a total of 10 symmetries. 

In a regular odd n-gon, there are n such flips, along with n rotations. 
Summing rotations and reflections, we find that a regular odd n-gon also has 
2n symmetries. We leave it as an exercise for the reader to show that these 2 
symmetries are all distinct. The same argument used in Example 2 shows that 
there are only these 2” symmetries. = 


Example 4 


Describe the symmetries of a tetrahedron. 

A tetrahedron consists of four equilateral triangles that meet at six edges 
and four corners (see Figure 9.4). Besides the motion leaving all corners fixed, 
call it the 0° motion, we can revolve 120° or 240° about a corner and the center 
of the opposite face (see Figure 9.4a), or we can revolve 180° about the middle 
of opposite edges (see Figure 9.4b). Since there are four pairs of a corner and 
opposite face and three pairs of opposite edges, we have a total of 1 (0° motion) 
+ 4X 2 + 3 = 12 symmetries. It is an exercise to check that these 12 
symmetries are distinct and that no other symmetries exist. = 


The symmetries of a square are naturally characterized by the way they 
permute the corners of the square. Thus, the 180° rotation 7; (see Figure 9.2) 
can be described as the corner permutation: a > c, b > d, c > a, d => b; in 
abcd 
cdab 
> b,b->c,c7>d,d->a,ora>b>c->d—-a. 

A permutation of the form x, > x) > x3... —> x, —> x, is called a cyclic 
permutation or cycle. Thus, 7, is a cycle of length 4. Cycles are usually written 
in the form (%).42%3.. . . X,X). So 7 = (abcd). Any permutation can be expressed 


tabular form, we write ( ) The 90° rotation 7, can be described as: a 


Figure 9.3 Symmetric reflection of a pentagon 
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Aw 


(b) 


Figure 9.4 Symmetric revolutions of 
a tetrahedron 


as a product of disjoint cycles (proof of this claim is an exercise). For example, 
13 = (ac)(bd), 1, = (adcb), and 7, = (a)(bd)(c). The reader should find cycle 
decompositions for the other 7;s. The depiction of a motion as in Figure 9.2, 
with arrows indicating the mapping at each corner, will make it easier to trace 
out cycles in later calculations. 

In permuting the corners, the symmetries create permutations of the color- 
ings of the corners. For example, if C; is the ith square in Figure 9.1, then 77; 
is the following permutation of colorings: 


= & C; C; Cs Cs Ce C; Cs Cy Cio Cy Cio C3 Ci Cis =) 


1 
Ci Ci Cs Cp Cy Cy Cy C5 Cp Nor Cus Cig Cs Cn Cr Ci w 


The point is that while a symmetry 7; is easily visualized by how it moves the 
corners of the square, what we are really interested in is the way 7; takes one 
coloring into another (making them equivalent). Thus, we formally define our 
coloring equivalence as follows: 


colorings C and C’ are equivalent, C ~ C’, 
if there exists a symmetry 7; such that 7(C) = C’ (2) 


The properties of the set G of symmetries that interest us are the ones that 
make the relation C ~ C’ in Eq. (2) an equivalence relation. These properties 
of G are (here 7; - 7; means applying motion 1; followed by motion 1): 


1. Closure: If 7, 7 © G, then 7; - 7; © G; for example, in Figure 9.2, 7, - 
Ts; = 1. 

2. Identity: G contains an identity motion 7, such that m,- 7; = 7; and 7; - 
w = 7; in Figure 9.2, my is 7). 

3. Inverses: For each 7; € G, there exists an inverse in G, denoted 7;', such 
that 7;'- 7; = mand 7; + 7;' = m, for example, in Figure 9.2, 77! = 7m. 


Observe that closure makes our coloring relation ~ satisfy transitivity 
[property (i) of an equivalence relation]. For suppose C ~ C’ and C’ ~ C”. 
Since C ~ C’, there must exist 7; € G such that 7; (C) = C’. Similarly, there 
is a 7; © G such that 7; (C’) = C”. Then by closure, there exists 7 = 7; - 
a; & G with 7(C) = (7; + 7(C) = C”. Thus C ~ C”. Similarly, properties 
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(2) and (3) of the symmetries imply that our coloring relation satisfies properties 
(ii) and (iii) of an equivalence relation, respectively (see Exercise 16). 

A collection G of mathematical objects with a binary operation is called 
a group if it satisfies properties 1, 2, and 3 along with associativity property— 
(7; * 1;) * ™ = 1; ° (1; ° 7). Thus we have the following theorem. 


Theorem 


Let G be a group of permutations of the set S (corners of a square) and T be 
any collection of colorings of S (2-colorings of the corners). Then G induces 
a partition of T into equivalence classes with the relation C ~ C' < some 
am & G takes C to C’. 


Note that S could be any set of objects and T could be any possible collection 
of colorings. The following simple lemma about groups lies at the heart of the 
counting formula developed in the next section. 


Lemma 
For any two permutations 77, 7; in a group G, there exists a unique permutation 
m, = 7; ' + min G such that 7; +m = 7). 
Proof 
First we show that 7; + 7, = 7. Since m = 7; ' - 7, then 
T° 1, = 1° (mt; '+ 1) = (m+ 7;')+ 7; (by associativity) 
= 1° 0; = 7; 
as claimed. Next suppose there also exists a permutation 7; such that 7; - 
a, = m;. Then a; + 7, = 7; + mj. Multiplying the equation by 7;', we have 
Me! (M+ My) = Hj (Mj HE) => (Fm) + m= (E+) + Hy 


SUL =i Smh = + 


9.1 EXERCISES 


Ms Summary of Exercises The first 13 exercises continue the 


examples of equivalence and symmetries given in this section. The remaining 
exercises develop basic aspects of group theory associated with symmetries. 
Prior experience with modern algebra is needed for most of these latter problems. 


1. Which of the following relations are equivalence relations? State your 
reasons. 
(a) = (less than or equal to), for a set of numbers. 
(b) = (equal to), for a set of numbers. 


10. 
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(c) ‘‘Difference is odd,’’ for a set of numbers. 
(d) ‘‘Being blood relations,’’ for a group of people. 
(e) ‘‘Having a common friend,’’ for a group of people. 


. Which of the following collections with given operations are groups? For 


those collections that are groups, which elements are the identities? 

(a) The nonnegative integers 0, 1, 2, ... with addition. 

(b) The integers 0, 1, 2,..., ” — 1 with addition modulo a. 

(c) All polynomials (with integer coefficients) with polynomial addition. 
(d) All nonzero fractions with regular multiplication. 

(e) All invertible 2 X 2 real-valued matrices with matrix multiplication. 


. Find all symmetries of the following figures (indicate with arrows where 


the corners move in each symmetry, as in Figure 9.2): 


(a) (b) “777° © @ > 
¢ b Fi ; d d c 


Write the following symmetries or permutations as a product of cyclic 
permutations: 


(a) 7 (b) 73 (c) 7% (d) 7, 


1234567 
(e) 14° 7; (f) 177° 1%, (g) cern 


. For the symmetries of the square listed in Figure 9.2, give the associated 


permutation of 2-colorings, as in Eq. (1), for: 
(a) 7, (b) 7 (c) 75 (d) 77, 


. (a) List all 2-colorings of the three corners of a triangle. 


(b) For the following symmetries of a triangle, give the associated permuta- 
tion of 2-colorings, as in Eq. (1), for: 
(i) 7 = 120° rotation (ii) 7 = flip about vertical axis. 


. Show that the 8 symmetries of a square listed in Figure 9.2 are all distinct. 
. Show that the 2” symmetries of an n-gon (even or odd) mentioned in 


Examples 2 and 3 are all distinct. 


. Show that the 12 symmetries of a tetrahedron listed in Example 4 are all 


distinct. Show that there are exactly 12 symmetries of a tetrahedron. 


Find the symmetry of the square equal to the following products (remember 
that 7; - 7; means applying motion 7; followed by motion 7;): 


(a) 7° 74 (b) 72° 7s (c) 77° (72° 73) (d) (77+ 75) + 773 
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11, 


(a) Write out the 6 X 6 multiplication table for the product of all pairs 
of symmetries of a triangle. 


(b) Repeat part (a) for integers 1, 2, 3, 4 with multiplication modulo 5. 
(c) Repeat part (a) for the following group of permutations of 1, 2, 3, 4: 


1234)\ (1234) (1234) (1234 
1234) \2143/ \3412/ \4321 


. Find two symmetries of the square 7, 7; such that 7; - 7; # 7° 7; (this 


means the group of symmetries of a square is noncommutative). 


. Many organic compounds consist of a basic structure formed by carbon 


atoms (carbon atoms are the corners of a floating figure), plus submolecular 
groups called radicals that are attached to each carbon atom (the carbons 
are like corners and the radicals like colors). Suppose such an organic 
molecule has six carbon atoms, with four radicals of type A and two radicals 
of type B. Suppose this molecule has three isomers, that is, three different 
ways that the two types of radicals can be distributed. Which of the follow- 
ing two hypothetical carbon structures would have three isomers with four 
As and two Bs? 


. Prove that an equivalence relation partitions a set into disjoint subsets of 


mutually equivalent elements. 


. Give a procedure for decomposing any permutation into a product of cycles. 
. Show that properties (2) and (3) of a group G of permutations imply 


properties (ii) and (iii), respectively, of the associated equivalence relation 
defined in the theorem. 


. Let S be a set and G a group of permutations of §. For any two subsets 


S|, S, of S, define S, ~ S, to mean that for some 7 € G, S,; = 7(S,) (={7(s) 
| s € S,}). Show that ~ is an equivalence relation. 


. Prove that the set of permutations of the 2-colorings of a square [see Eq. 


(1)] forms a group. 


. Prove that for any prime p, the integers 1,2,..., p — 1 with multiplication 


modulo p form a group. 


. A subset of elements in a group G is said to generate the group if all 


elements in G can be obtained as (repeated) products of elements in the 

subset. 

(a) Which of the following subsets generate the group of symmetries of 
a square? 
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(i) 71, 7, 73 (ii) 7, 75 (iii) 73, 76 (iv) 16, 77. 

(b) Show that the group of symmetries of a regular n-gon can be generated 
by a subset of two elements. 

21. If a subset G’ of elements in a group G is itself a group, then G’ is called 

a subgroup of G. 

(a) Show that G’ = {7, 7, 73, 7} and G” = {m,, 77} are subgroups of 
the group G of symmetries of a square. 

(b) Find another 4-element subgroup of G containing 773. 

(c) Find all subgroups of the group G of symmetries of a square. 


22. (a) How many different binary relations on n elements are possible? 
(b) How many symmetric binary relations are possible? 


23. Show that 7; ~ 7; if there exists 7 © G such that 7; = mw 'mm is an 
equivalence relation. 


24. A transposition is a cycle of size 2, that is, a permutation that interchanges 
the positions of just two elements (and leaves all other elements fixed). 
Show by induction that any permutation of n elements can be written as 
a composition of transpositions. 


25. For a given group G of n elements, define the function f, on G as follows: 
for each 7’ € G, f,(m') = 7-7’. 
(a) Show that f, is a one-to-one mapping for any 7 & G. 


(b) If we define f,, “fr, = Sa;-n,» Show that the set {f,} forms a group. 


9.2. BURNSIDE’S THEOREM 


We now develop a theory for counting the number of different (nonequivalent) 
2-colorings of the square. More generally, in a set T of colorings of the corners 
(or edges or faces) of some figure, we seek the number N of equivalence classes 
of T induced by a group G of symmetries of this figure. 

Suppose there is a group of s symmetries acting on the c colorings in T. 
Let E¢ be the equivalence class consisting of C and all colorings C’ equivalent 
to C, that is, all C’ such that for some 7 € G, m7 (C) = C’. If each of the s 
ms takes C to a different coloring 7(C), then Ec would have s colorings. Note 
that the set of 7(C)s includes C since 7(C) = C (m is the identity symmetry). 
If every equivalence class is like this with s colorings, then 


sN = c: (number of symmetries) X (number of equivalence classes) = 
(total number of colorings) 
Solving for N, we have N = c/s. 
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Consider, for example, the c = n! oriented seatings of n people around a 
table. There are s = n cyclic rotations of the seatings, and each equivalence 
class consists of n seatings. Thus, the number of equivalence classes (cyclicly 
nonequivalent seatings) is N = n!/n = (nm — 1)! 

On the other hand, suppose we have a small round table with three positions 
for chairs (each 120° apart), and white and black chairs are available. There 
are 2? = 8 ways to place a white or black chair in each position. There are 
three cyclic rotations of the table possible, 0°, 120°, and 240°. We have c = 8 
“‘colorings’’ and s = 3 symmetries, but the number of equivalence classes 
cannot be N = §, a fraction! 

It is true that the three arrangements of one black and two white chairs (or 
vice versa) form an equivalence class, since 0°, 120°, and 240° rotations move 
the one black chair to different positions. However, an arrangement of three 
black chairs (or three white chairs) forms an equivalence class by itself. See 
Figure 9.5 Any rotation maps this arrangement of 3 black chairs into itself, 
that is, leaves it fixed. 

We need to correct the numerator in the formula N = c/s by adding 
the multiplicities of an arrangement, when several cyclic motions map the 
arrangement to itself instead of to other arrangements. In this way every equiva- 
lence class still will have s members when multiplicities are counted. Since 
two symmetries, besides the 0° symmetry, leave the all-black-chair arrangement 
fixed and similarly for the all white, then counting arrangements in Figure 9.5 
with multiplicities we have 


N=34+14+1+14+1+1+41 +4 3)/3 = 12/3 =4 


The “‘multiplicity’’ correction is even more complicated for 2-colorings 
of our square. Here the size of an equivalence class of colorings may be 1 or 
2 or 4, but never 8 (= the number of symmetries of the square). The first 
problem is that several 7s, besides the identity symmetry 7,, may leave a 
coloring C; fixed—that is, 7(C;) = C;. The other problem is that if C, is another 
coloring in C;’s equivalence class, there may be several 7rs all taking C; to C,. 
For example, the coloring Ci) (see Figure 9.1) is fixed by symmetries 7, 73, 
17, Ms,and is mapped to C,, by symmetries 7, 774, 15, 1. 

By the lemma in Section 9.1, the symmetries 7, 14, 75, 15 taking Ci) to 


Multiplicity 3 


OOOO 


Figure 9.5  Muttiplicity 1 
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C\; can be written in the form 7 = 7, - 7’, where 7’ is a symmetry that leaves 
C\, fixed. For example, 


_,.7. (abcd) _ fabcd\ [abcd 
tee eI badc}) \bcda cbad 
Similarly 7, = m+, %4 = 1° 3, % = + M1. Conversely, given any 7* 
that leaves C,, fixed, 7 - 7* takes Ci) to C,, and so 77) - 7* must be one of To, 
14, 1, ™. Thus there is a 1-1 correspondence between the 7s that take C 190 to 
C,, and the zs that leave C,, fixed. Therefore, to count the colorings in an 
equivalence class E with appropriate multiplicities (i.e., coloring C,, has multi- 
plicity 4 since four different 7s take Cio to C,,), it suffices for each coloring 
in E to sum the number of ts that leave the coloring fixed. 

In the case of the equivalence class consisting of Ci) and C 1, each of Cig 
and C;, have multiplicity 4, so that the size of their equivalence class including 
multiplicities is 4 + 4 = 8 (= s, the number of symmetries), as required. 

In general, when multiplicities are counted, each equivalence class E will 
have s elements. If d(x) denotes the number of 7s that leave the coloring x 
fixed, then S,<¢ f(x) = s. 


Formal proof that See $(x) = s: Let x), x2, ..., Xp be the colorings in 
equivalence class E and let 7, 7, ..., 7, be the group of symmetrices. These 
7s can be divided into m groups, R,, R:, ..., Rn, where R; is the set of 7s that 


map x, to x;. As shown above, the number of 7's mapping x, to x; equals #(x;), 
the number of 7rs that leave x; fixed. Thus, ¥,<- #(x) sums the number of 7s 
in R,, in Ry), ..., and in R,,. But the sum of the Rs is just the total number of 
symmetries, s. 

Summing over all equivalence classes, we obtain the following theorem, 
first proved by Burnside in 1897. 


Theorem (Burnside) 


Let G be a group of permutations of the set S (corners of a square). Let T be 
any collection of colorings of S (2-colorings of the corners) that is closed under 
G. Then the number N of equivalence classes is 


1 


N=qm A(x) 
or 
nee 
N= Gd Ma) (*) 


where |G| is the number of permutations and V(7) is the number of colorings 
in T left fixed by 7. 


By ‘‘closed under G,’’ we mean that for all 7 € G.x © T=> mx) © T. 
This closure property is automatic when T is the set of all corner 2-colorings 
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of the square. In Section 9.4 we need to apply formula (*) to special subsets 
of S, such as the set of colorings with two corners black and two corners white, 
that are closed under G. 

The two sums in the theorem both count all instances of some coloring 
being left fixed by some 7, the first sums over the different colorings, the 
second sums over different 7s. Formula (*) will turn out to be more useful in 
later computations. 

We informally summarize the spirit behind formula (*) as follows. The 
total number c of all colorings of our square is equal to V’(77,), since the identity 
symmetry 77, leaves all colorings fixed. If each of the 8 symmetries mapped a 
coloring C into 8 different colorings, then we would have 8 colorings in 
each equivalence class and hence a total c/8 = V(z,)/|G| equivalence classes. 
However, the colorings in collection {7(C):7 © G} forming an equivalence 
class of colorings of an unoriented square are never all distinct. The terms 
(7) in (*) add the ‘‘multiplicities’’ of repeated colorings so that each equiva- 
lence class has 8 colorings in the sum in (*). 


Example 1 


A baton is painted with equal-sized cylindrical bands. Each band can be painted 
black or white. If the baton is unoriented as when spun in the air, how many 
different 2-colorings of the baton are possible if the baton has 2 bands? 3 
bands? 4 bands? 

The batons with 2 bands, 3 bands, and 4 bands are pictured in Figure 9.6. 


(a) The group of symmetries for the 2-band baton is 7, = and 7, = 


21 
be thought of as a 0° revolution of the baton—z7;, is the identity symmetry— 
and 77, as a 180° revolution of the baton. The set of 2-colorings left fixed 
by 7 is all 2-colorings of the baton. There are 2” = 4 2-colorings, and so 
W(7,) = 4. The set of 2-colorings left fixed by 7 consists of the all-black 
and all-white coloring, and so W(7,) = 2. By Burnside’s Theorem, the 
number of different colorings is 4[W(7,) + W(7,)] = (4 + 2) = 3. 


(b) The symmetry group for the 3-band baton is 7; = € ) and 7, = 


2 
(3 ) In terms of cycles, 7, = (1)(2) and m7, = (12). Physically, 7, can 


Ce In terms of cycles, 7, = (1)(2)(3) and 7, = (13)(2). Again, 7 is a 


0° revolution of the baton and 7 is a 180° revolution of the baton. All 2? 


PARSER 
(b) (c) 


Figure 9.6 = (a) 
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2-colorings are left fixed by 7), and so V(z,) = 23 = 8. The set of 2-colorings 
left fixed by 7, can have any color in the middle band (band 2) and a common 
color in the two end bands, and so V(7,) = 2 X 2 = 4. The number of different 
colorings is 3[V(7,) + W(m)] = #8 + 4) = 6. 


(c) The symmetry group for the 4-band baton is 7, = ( ah and 


TT = yy In terms of cycles, 7, = (1)(2)(3)(4) and 7 = (14)(23). 


Again, 7, is a 0° revolution of the baton and 7, is a 180° revolution of the 
baton. All 2* 2-colorings are left fixed by 7, and so V(7,) = 2* = 16. The set 
of 2-colorings left fixed by 7, have a common color for the end bands and a 
common color for the inner bands, so W(7) = 2 X 2 = 4. The number of 
different colorings is 3[W(7,) + W(7,)] = 4116 + 4) = 10. » 


Example 2 


How many different 3-colorings of the bands of an n-band baton are there if 
the baton is unoriented (as in Example 1)? 

As in Example 1, the symmetries of the baton are a 0° revolution and a 
180° revolution. We apply formula (*) for the group of the 0° and 180° revolution 
acting on an n-band baton with 3 colors. There are 3" colorings of the fixed 
baton and so ‘¥(0°) = 3”. The number of colorings left fixed by a 180° spin 
depends on whether n is even or odd. 

If n is even, each of the n/2 bands on one half of the baton can be any 
color—3”” choices—and then for the coloring to be fixed by a 180° spin, 
each of the symmetrically opposite bands must be the corresponding color. So 
‘P(180°) = 3°” and we have from formula (*): N = 43" + 3"), 

To enumerate batons left fixed by a 180° spin when n is odd, we can use 
any color for the ‘‘odd’’ band in the middle of the baton—3 choices. Each of 
the (n — 1)/2 bands on one side of the middle band can be any color—3"™!)? 
choices—and again the other (n — 1)/2 bands must be colored symmetrically. 
So V((180°) = 3 x 3%)? = 3002 and N = 43" + 37D?) 


Example 3 


Suppose a necklace can be made from beads of three colors, black, white, and 
red. How many different necklaces with n beads are there? 

When the 7 beads are positioned symmetrically about the circle, the beads 
occupy the positions of the corners of a regular n-gon. Thus, our question asks 
for the number of corner colorings of an n-gon using three colors. The answer 
depends on what is meant by ‘‘different.’’ If the beads are not allowed to move 
about the necklace, that is, the n-gon is fixed, the answer is 3 X 3 X -++ X 
3 = 3" (three color choices at each of n corners). A more realistic interpretation 
of our problem would allow the beads to move freely about the circle, that is, 
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the n-gon rotates freely (but in this case flips will not be allowed). We employ 
formula (*) to count the number N of equivalence classes of these 3-colorings 
induced by the rotational symmetries of an n-gon. We shall do the calculations 
for n = 3. A more general technique for larger n is developed in the next 
section. 

There are 3? = 27 3-colorings of a 3-bead necklace, and three rotations 
of 0°, 120°, 240°. The 0° rotation leaves all colorings fixed, and so Y(0°) = 
27. The 120° rotation cannot fix colorings in which some color occurs at only 
one corer. It follows that the 120° rotation fixes just the monochromatic color- 
ings. Thus, ¥(120°) = 3. The 240° rotation is a reverse 120° rotation, and so 
*¥(240°) = 3. By formula (*), we have 


N=5Q7+3+3)=11e 


9.2 EXERCISES 


Mb Summary of Exercises The exercises continue the application 


of Burnside’s Theorem to count colorings introduced in Examples 1, 2, and 3. 
1. How many different n-bead necklaces are there using beads of red, white, 
blue, and green (assume necklaces can rotate but cannot flip over)? 
(a) n = 3 (b) n =4 


2. How many different ways are there to place a diamond, sapphire, or ruby 
at each of the four vertices of this pin? 


3. Fifteen balls are put in a triangular array as shown. How many different 
arrays can be made using balls of three colors if the array is free to rotate? 


4. How many different ways are there to 2-color the 64 squares of an 8 X 8 
chessboard that rotates freely? 


5. A merry-go-round can be built with three different styles of horses. How 
many five-horse merry-go-rounds are there? 
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6. A domino is a thin rectangular piece of wood with two adjacent squares 
on one side (the other side is black). Each square is either blank or has 1, 
2, 3, 4, 5, or 6 dots. 
(a) How many different dominoes are there? 


(b) Check your answer by modeling a domino as an (unordered) subset 
of 2 numbers chosen with repetition for 0, 1, 2, 3, 4, 5, 6. 


7. Two n-digit decimal sequences consisting of digits 0, 1, 6, 8, 9 are vertically 
equivalent if reading one upside down produces the other, that is, 0068 
and 8900. How many different (vertically inequivalent) n-digit (0, 1, 6, 8, 
9) sequences are there? 

8. How many different ways are there to color the five faces of an unoriented 
pyramid (with a square base) using red, white, blue, and yellow? 

9. (a) Find the group of all possible permutations of three objects. 

(b) Find the number of ways to distribute 12 identical balls in three indistin- 
guishable boxes. [Hint: First let boxes be distinct, and then use part 


(a).] 


10. How many ways are there to 3-color the n bands of a baton if adjacent 
bands must have different colors? 


11. How many ways are there to 3-color the corners of a square with rotations 
and reflections allowed if adjacent corners must have different colors? 
12. (a) How many ways are there to distribute 12 red jelly beans to four 
children, a pair of identical female twins and a pair of identical male 
twins? 
(b) Repeat part (a) with the requirement that each child must have at least 
one jelly bean. 


13. (a) Show that for any given i, the subset G; of symmetries of the square 
that leave coloring C; (of Figure 9.1) fixed is a group (a subgroup of 
all symmetries). 


(b) Find G,. 
(ce) Find G. 


——<$+______y 


9.3. THE CYCLE INDEX 


Without further theory, we would find it very difficult to apply Burnside’s 
Theorem to counting different colorings of an unoriented figure. Recall that 
Bumside’s Theorem in Section 9.2 says that the number N of equivalence 
classes in a set T of colorings of a figure with respect to a group of symmetries 
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G is 
1 
N iG > W(7) 

where (7) is the number of colorings in T left fixed by 7. If the set T were 
all 3-colorings of the corners of a 10-gon or a cube, it would seem close to 
impossible to determine (77), the number of colorings left fixed by a symmetry 
a of the figure. We shall show that V(7) can be determined easily from the 
structure of 7. This approach leads to greatly shortened computations. We 
develop the theory for this simplified calculation of V(7) in terms of 2-colorings 
of a square. 

Let us apply Burnside’s Theorem to the 2-colorings of the square. Initially 
we determine the number of 2-colorings left fixed by motion 7; by inspection 
(using Figures 9.1 and 9.2). As we count V(7;) for each 7;, we look for a 
pattern that would enable us to predict mathematically which colorings, and 
thus how many colorings, must be left fixed by each 7;. It is helpful to make 
a table of the 7,s and the colorings that they leave fixed; see columns (i) and 
(ii) in Figure 9.7 [columns (iii) and (iv) are developed later]. 

The 0° rotation 77; leaves each corner fixed and hence it leaves all colorings 
fixed. So V(7;) = 16. The 90° rotation 7, cyclicly permutes corners a, b, c, d. 
Being left fixed by the 90° rotation means that each corner in a coloring has 
the same color after the 90° rotation as it did before. Since 7, takes a to b, 
then a coloring left fixed by 7, must have the same color at a as at b. Similarly, 
such a coloring must have the same color at b as at c, the same color at c as 
at d, and the same at d as at a. Taken together, these conditions imply that 
only the colorings of all white or all black corners, C; and Cj, are left fixed. 
Thus W(7,) = 2. In general, a coloring C will be left fixed by 1 if and only if 
for each corner v, the color of C at v is the same as the color at m(v) so that 
the symmetry leaves the color at 1(v) unchanged. 


(i) (ii) (iii) (iv) 


Motion Colorings Left Cycle Structure Inventory of Colorings 

Ty, Fixed by 1, Representation Left Fixed by 7, 

Tt) 16—all colorings xt (b + wy = 1b + 4b3w + 6b’w? + 4bw? + 1wt 

Tr 2—C\, Cig Xe (b* + w') = 1bt + Iw* 

7 2—C1, Cir Curie 4 (+ wy? = 1b! + 2b’w? + Iw! 

4 2—C), Cis X4 (bt + w') = 1p + tw* 

Ts 2—C), Ce, Cs, Cis xB (bP + wy = 1b* + 2b’w? + 1w* 

LA 2—6,:Cz Cos Cie xB (P+ wy = 1p + 2b’w? + Iw 

W 8—C,, Cr, Ca, Cro XIX (b + wb? + w*) = lb' + 2b’w + 2b'w? + 2bw? + Iwt 
Cu, Cr, Cia, Cro 

Ts 8—C,, Ca, Cs, Cro XiX2 (b + wh + w) = 1bt + 2b'w + 2b'w? + 2bw? + 1w* 
Cu, Crs Ciss Cre 


Figure 9.7 
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Next we consider the 180° rotation 73. Looking at the depiction of 7; in 
Figure 9.2, we see that 7; causes corners a and c to interchange and corners b 
and d to interchange. It follows that a coloring left fixed by 7; must have the 
same color at corners a and c and the same color at b and d (no further conditions 
are needed), With two color choices for a, c and with two color choices for b, 
d, we can construct 2 X 2 = 4 colorings that will be left fixed, namely, C,, 
Cio, Cy, Cis. Hence V7) = 4. 

Symmetry 7, is a rotation of 270° or —90°, and so the symmetry is similar 
to 7. Hence V(7,) = ‘V(a2) = 2. The horizontal rotation 7; interchanges corners 
a, b and interchanges corners c, d. Like the 180° rotation 73, symmetry 7; will 
leave a coloring fixed if corners a and b have the same color—2 choices, white 
or black—and corners c and d have the same color—2 choices. Then like 7, 
we have V(7s) = 2 X 2 = 4 colorings fixed, namely, C,, Cy, Cy, Cis. 

A pattern is becoming clear. The reader should now be able quickly to 
predict that 7, will also leave 2 X 2 = 4 2-colorings of the square fixed, for 
again this symmetry interchanges two pairs of corners. Formally, an interchange 
is a cyclic permutation on two elements. All our enumeration of fixed colorings 
has been based on the fact that if 7; cyclicly permutes a subset of corners (that 
is, the corners form a cycle of 7), then those corners must be the same color 
in any coloring left fixed by 7;. 

As mentioned in Section 9.1, any 7; can be represented as a product of 
disjoint cycles. For example, 7; = (ac)(bd) and 1, = (adcb). For each symmetry, 
we need to get such a cyclic representation and count the number of cycles. 
For future use, let us also classify the cycles by their length. It will prove 
convenient to encode a symmetry’s cycle information in the form of a product 
containing one x, for each cycle of 1 corner in 7;, one x, for each cycle of size 
2, and so forth. This expression is called the cycle structure representation 
of a symmetry. 

The cycle structure representation of 7 and 7, is x, since each consists 
of one 4-cycle: 7, = (abcd) and 7, = (adcb). The cycle structure representation 
for 773, 75, and 7, is xx. or x3, since each consists of two 2-cycles. Column 
(iii) in Figure 9.7 gives the cycle structure representation of each symmetry. 

What about 7? Previously, it sufficed to say that 7, leaves all colorings 
fixed. Now it is time to point out that a corner left fixed by a permutation is 
classified as a 1-cycle. Thus 7; consists of four 1-cycles. Its cycle structure 
representation is then xj. We “‘predict’’ that 7, leaves 2+ = 16 colorings fixed, 
that is, 7, leaves all 2-colorings fixed. 

For any 7;, the number of colorings left fixed will be given by setting each 
x; equal to 2 (or, in general, the number of colors available) in the cycle 
structure representation of 7;, that is, 


W(7) = Daumber of cycles in 7 


Let us apply this theory to 7; and 73. For each of these reflections, the cycle 
structure representation is seen to be xx, (follow the arrows in Figure 9.2) and 
thus for each we can find 2? X 2 = 8 colorings that are left fixed. 
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To obtain the number of different 2-colorings of the floating square with 
Burnside’s Theorem, we sum the numbers in column (ii) of Figure 9.7 and 
divide by 8: 


PUMA PUSH 2H4 +2 HAF ATE HR) = FAB) 


There is a slightly simpler way to get this result. First, algebraically sum the 
cycle structure representations of each symmetry, collecting like terms together, 
and then divide by 8. From column (iii) of Figure 9.7, we obtain Axi + 
2x, + 3x3 + 2x?x,). This expression is called the cycle index P(x), x2, ..., 
x,) for a group G of symmetries. By setting each x; = 2, that is, Pg(2, 2,..., 
2), we get the same answer. (Before, the steps were reversed: We first set 
x; = 2 in each cycle structure representation and then added.) 

Suppose that instead of 2 colors, we had 3 colors. Then the same reasoning 
applies, but now there are 3 choices for the color of the corners in each cycle. 
If a symmetry has k cycles, then it will leave 3* 3-colorings of the square fixed, 
and the number of different 3-colorings will be P¢(3, 3, . . . , 3). More generally, 
for any m, P¢(m, m, ..., m) will be the number of nonequivalent m-colorings 
of an unoriented square. The argument used to derive this coloring counting 
formula with the cycle index is valid for colorings of any set with associated 
symmetries. 


Theorem 


Let S be a nonempty set of elements and G be a group of symmetries of S that 
acts to induce an equivalence relation on the set of m-colorings of S. Then the 
number of nonequivalent m-colorings of S is given by Pg(m, m, ..., m). 


Example 1 


Use this theorem to re-solve the necklace counting problems of Example 3 in 
Section 9.2. 

Recall that beads can rotate freely around the circle but reflections are not 
permitted, that three colors of beads are available, and that the number of 
different 3-colored strings of n beads is equal to the number of 3-colorings of 
a cyclicly unoriented n-gon. For n = 3, the rotations are of 0°, 120°, and 240° 
with cycle structure representations of x}, x3, and x3, respectively. Thus, Pg = 
4(x} + 2x3). The number of 3-colored strings of three beads is P,(3, 3, 3) = 
4(33 + 2 x 3) = 11. More generally, the number of m-colored strings of three 
beads is Pg(m, m, m) = 3(m?> + 2m). 

Let us try a more complicated case: n = 8 (see Figure 9.8). The rotations 
are of 0°, 45°, 90°, 135°, 180°, 225°, 270°, and 315°. The 0° rotation consists 
of eight 1-cycles. The 45° rotation is the cyclic permutation (abcdefgh). The 
90° rotation has the cycle decomposition (aceg)(bdfh). The 135° rotation is the 
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Figure 9.8 


cyclic permutation (adgbehcf). The 180° rotation has the cyclic decomposition 
(ae) (bf) (cg) (dh). The cycle structure representations are thus: 0° rotation, 
x1; 45° rotation, xg; 90° rotation, x}; 135° rotation, xs; and 180° rotation, x3. 
The 225°, 270°, and 315° rotations are reverse rotations of 135°, 90°, 45°, 
respectively, and have the corresponding cycle structure representations. Col- 
lecting terms, we obtain 


Po= scrt + 4x, + 2x3 + x3) 
The number of different m-colored necklaces of eight beads is 
sn! + 4m + 2m? + m*) 
For m = 3, we have 


ght 4X3-4+2% 3434 = (6561 + 12 + 18 + 81) = 8348 


Example 2 


In Example 4 in Section 9.1 we listed the 12 symmetries of the tetrahedron 
(see Figure 9.4 in Section 9.1): the 0° revolution, the eight revolutions of 120° 
and 240° about a corner and the middle of the opposite face, and the three 
revolutions of 180° about the middle of opposite edges. We now use our theorem 
to determine the number of corner 3-colorings of the floating tetrahedron. 
The 0° revolution has the cycle structure representation x4. The 120° revolu- 
tion about corner a and the middle of face bcd has the cyclic decomposition 
(a)(bcd) and its cycle structure representation is x,x;. By symmetry, the other 
120° and 240° revolutions have this same cycle structure representation. The 
180° revolution about the middle of edges ab and cd has the cyclic decomposition 
(ab)(cd) and its cycle structure representation is x3. By symmetry, the other 
180° revolutions have the same cycle structure representation. Thus we have 


Po= s(x + 8x,x3 + 3x3) 
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The number of different corner 3-colorings is 


Po(3,3,3,3) = GIF 8X3X343X39) 


1 
= —(81+72+27)= 
77 (81 +72 + 27) = 15 & 


9.3. EXERCISES 


Mb Summary of Exercises The first 11 exercises count distinct 


colorings of unoriented figures. The remaining problems involve associated 
theory. Note that ‘‘floating’’ means that all possible rotations and reflection 
are allowed. 


1. 


How many ways are there to color the corners of a floating square using 
four different colors? 


. How many ways are there to 4-color the corners of a pentagon that is: 


(a) Distinct with respect to rotations only? 
(b) Distinct with respect to rotations and reflections? 


. How many ways are there to 3-color the corners of a hexagon that is: 


(a) Distinct with respect to rotations only? 
(b) Distinct with respect to rotations and reflections? 


(c) Find two 3-colorings of the hexagon that are different in part (a) but 
equivalent in part (b). 


. How many different n-bead necklaces (cyclicly distinct) can be made from 


three colors of beads when: 
(a) n=7 (b) n =9 (c) n = 10 (d) n= 11 


. Find the number of different m-colorings of the vertices of the following 


floating figures. 


(a) A (b) * (2 ee , 
h d | [> 
e d 
c b g f e 


. (a) Find the cycle index for the group of symmetries of a square in terms 


of permutations of edges, not corners. 


14. 
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(b) How many ways are there to 3-color the edges of a floating square? 

(c) How many ways are there to 3-color both edges and corners of a 
floating square? 

(d) Why does not: (number of floating 3-colorings of corners) (number of 
floating 3-colorings of edges) = (number of floating 3-colorings of 
edges and corners)? Explain. 


. How many ways are there to 3-color the edges of the floating figures in 


Exercise 5? (Hint: The cycle index now is for symmetries of the edges.) 


. (a) Find the number of different n-bead 3-colored necklaces (cyclicly dis- 


tinct) in which each color appears at least once when (i) n = 3, 
(ii) n = 4, Gti) n = 7. 
(b) Repeat part (a) when necklaces may reflect as well as rotate. 


. Find the number of different 2-sided dominoes (two squares of 1 to 6 dots 


or a blank on each side of the domino). 


. (a) Let G be the group of all 4! permutations of 1, 2, 3, 4. Find Pg. 


(b) Use part (a) to find the number of ways to paint four identical marbles 
each one of three colors (check your answer by modeling this problem 
as a selection-with-repetition problem). 


(c) Use part (a) to find the number of ways to put 12 balls chosen from 
three colors into four indistinguishable boxes with 3 balls in each box. 


. (a) Find the number of 2 X 4 chessboards distinct under rotation whose 


squares are colored red or black. 


(b) Suppose that two chessboards are also considered equivalent (aside 
from rotational symmetry) if one can be obtained from the other by 
complementing red and black colors. How many different 2 X 4 chess- 
boards are there? 


. Show that if xix - + + xf is the cycle index for a symmetry 7 of an n-gon 


(expressed in terms of a permutation of the corners), then 1k; + 2k, + 
s+ + mk, =n. 


- In solving for the number of corner 2-colorings of some unoriented figure, 


suppose we are given the cycle index P,;, of the group G* of induced 
permutations of the 2-colorings [as in Eq. (1) in Section 9.1], instead of 
the usual cycle index Pg of the group of symmetries of the figure. What 
integer values should be substituted now for each x; in Pg, to get the 
number of 2-colorings, or will no substitution work? Explain. 


Let S be some set of n objects and G a group of permutations of S. For 
subsets S,, S, of S, define the equivalence relation S; ~ S, if for some 
mt © G, S; = 1(S2) (={7(s) | s € S,}). Show that the number of equivalence 
classes equals P¢(2,2, ...). (There ate 2” subsets of S in all.) 
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15. Find the number of m-colorings of the corners of a p-gon, where p is a 
prime (p > 2), if: 
(a) Only rotations are allowed. 
(b) Rotations and reflections are allowed. 


———=moSr_ i$ 


9.4 POLYA’S FORMULA 


We are now ready to address our ultimate goal of a formula for the pattern 
inventory. Recall that the pattern inventory is a generating function that tells 
how many colorings of an unoriented figure there are using different numbers 
of colors. For example, for black-white colorings of the unoriented square, the 
pattern inventory is 1bt + 1b’w + 2b’w? + lbw? + 1w‘. The coefficients 
in the pattern inventory can be viewed as the results of several Burnside 
Theorem—type counting problems. Recall that Burnside’s Theorem says that 
the number N of equivalence classes in T, a collection of colorings of 5 (where 
S is the corners of a square or the faces of a tetrahedron, etc.), caused by a 
group of symmetries of S is 


N= > Van) (*) 


IG| TG 
In Section 9.3, we saw that 


W(7) =- Qaumber of cycles in 7 


In the case of 2-colorings of the floating square, we divide the colorings in 
Figure 9.1 into sets based on the numbers of black and white corners: 


T= {Ci} 

T, = {C., C3, Ca, Cs} 

T, = {Co, C7, Cs, Co, Cio, Cut 
T; = {Cyn, Cis, Cua, Cis} 

T, = {Cich 


In the pattern inventory, the coefficient of, say, b’w is the number of 
nonequivalent colorings with three blacks and one white. This number can be 
obtained from Eq. (*) if we let the group G of symmetries of the square act 
on just the set 7,. Formula (*) from.Burnside’s Theorem applies to any collection 
of colorings that is closed under G. The term ‘‘closed under G” means that for 
any 7 € G, and any coloring C € T, 7(C) equals another coloring in T. T; is 
closed under G, since any symmetry acting on a coloring with 3 black corners 
and 1 white corner yields another coloring with 3 black corners and 1 white 
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corner. The same is true for the other T,s. Thus, the coefficient of b**w* in the 
pattern inventory is the result of Eq. (*) when T, is the set on which G acts. 

Let us try to solve these five subproblems simultaneously. That is, we list 
in the first row of the table in Figure 9.7 [see column (iv)] the numbers of 2- 
colorings in each T;, left fixed by 7,, then in the second row of the table we 
list the numbers of 2-colorings in each T;, left fixed by 7, then by 73, and so 
forth. Then we total up the first number in each row (the number of 2-colorings 
with 4 blacks) and divide by 8 to get the coefficient of b‘ in the pattern inventory, 
total up the second number in each row and divide by 8 to get the coefficient 
of bw, and so forth. 

Since the action of 7, leaves all Cs fixed, the first row is 1, 4, 6, 4, 1. Let 
us put these data in the same form as the pattern inventory. We write: b‘ + 
4b’w + 6b’w? + 4bw> + w’; this is an inventory of fixed colorings. For 7,, 
the inventory of fixed colorings is an inventory of all colorings. Observe that 
this inventory is simply (b + w)‘ = (b + w)(b + w)(b + w)\(b + w), one (b 
+ w) for each corner. For 7, the inventory is b¢ + w‘. For 73, we find by 
observation that the inventory is b‘ + 2b’w? + w‘. This expression factors into 
(b? + w?), 

Just as we did before when counting the total number of colorings fixed 
by the action of some 7, let us look for a ‘‘pattern’’ in the inventories of fixed 
colorings. Again the key to the ‘‘pattern’’ is the fact that in a coloring left 
fixed by 7, all corners in a cycle of 7 must have the same color. Since 7, has 
one cycle involving all four corners, the possibilities are thus all corners black 
or all corners white; hence the inventory is b* + w‘. The motion 7; has two 
2-cycles (ac) and (bd). Each 2-cycle uses two blacks or two whites in a fixed 
coloring. Hence the inventory of a cycle of size two is b? + w?. The possibilities 
with two such cycles have the inventory (b? + w?)(b? + w?). 

The inventory of fixed colorings for 7; will be a product of factors (b/ + 
w/), one factor for each j-cycle of the 7;. So we need to know the number of 
cycles in 7; of each size. But this is exactly the information encoded in the 
cycle structure representation. Indeed, setting x j = (b/ + w’) in the representa- 
tion yields precisely the inventory of fixed colorings for 77. By this method we 
compute the rest of the inventories of fixed colorings. See Figure 9.7. For 7, 
especially, the inventory should be checked against the list of colorings in 
column (ii). The pattern inventory is obtained by adding together the inventories 
of fixed colorings, collecting like-power terms, and dividing by 8. 

As before in Section 9.3, we get a more compact formula and save some 
computation by first adding together the cycle structure representations and 
dividing by 8, and then setting each x; = (b/ + w/) and doing the polynomial 
algebra all at once. Again the first step in this approach yields the cycle index 
P¢(X1, X2, -.. , X,). Thus by setting x; = (b/ + w’) in Pg, we obtain the pattern 
inventory. 

If three colors, black, white, and green, were permitted, each cycle of size 
J would have an inventory of (b/ + w/ + g/) ina fixed coloring. So we would 
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set x; = (b/ + w’ + g/) in Pg. The preceding argument applies for any number 
of colors and any figure. In greater generality we have the following theorem. 


Theorem (Polya’s Enumeration Formula) 


Let S be a set of elements and G be a group of permutations of S that acts to 
induce an equivalence relation on the colorings of S. The inventory of nonequiv- 
alent colorings of § using two colors is given by the generating function 
P((b + w), (b? + w?), (b? + w3),..., (b* + w'). The inventory using colors 
Cis C3 se Ey IS 


Po Sc. S.ch.--- Set] 
=” Fl j=l 

For a moment, let us return to the problem of counting the rota! number 
of 2-colorings left fixed. This number is simply the sum of the coefficients in 
the pattern inventory. To sum coefficients, we set the indeterminants, b and w 
(and hence their powers), equal to 1, or, equivalently, set x; = 2 in Pg. If m 
colors were aliowed, we would set x; = m in Pg, obtaining the same formula 
as in the theorem in Section 9.3. 

As in many other generating function problems, the actual expansion of 
the generating function for a pattern inventory can be quite tedious. We are 
expanding expressions of the form (ci + ch +--+ + c!,)’. When m and r get 
large, it is time to turn to computer algebra software. 


Example I 


Determine the pattern inventory for 3-bead necklaces distinct under rotations 
using black and white beads. Repeat using black, white, and red beads. 
From Example 1 of Section 9.3, we know Pg = 3(xj + 2x3). Substituting 
= (b/ + w!), we get 


AG + wy + 2(b% + w)] = : [(b° + 3b°w + 3bw? + w*) + (2b° + 2w’)] 


= 5 (30° + 3b°w + 3bw? + 3w?) 


=b>+ bw bw? +w 


This result could be obtained empirically. There is only one way to color 
all beads black or all white. If one bead is white and the others black, then by 
rotation the white bead can occur anywhere; thus there is only one necklace 
with one white and two blacks. The same is true by symmetry for a necklace 
with one black and two whites. 

Now consider 3-bead necklaces using three colors. We substitute x; = 
(bi + wi + r/), in Pg to obtain 4[(b + w + ryY + 2(b3 + w? + 73). Instead 
of expanding this expression, we use indirect means. Perhaps again each inven- 
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tory term has coefficient 1. There is a general test for whether all inventory 
coefficients are 1: Compare N*, the number of terms in the pattern inventory, 
with N, the total number of patterns [i.e., Pg(m, m, ..., m)]. Since N equals 
the sum of the coefficients in the inventory, then N* = N if and only if each 
term has coefficient 1. The number N* of terms in the pattern inventory when 
n elements (corner, beads, etc.) are colored with m colors is C(n + m — 1, n) 
(see Exercise 19). For the case at hand, m = 3, n = 3, and so N* = 
(° +3-1 

3 
Since N* # N, we know that all terms do not have coefficient 1. 

On the other hand, the only way for N* = N + 1 is that nine terms in the 
inventory must have coefficient 1 and one term coefficient 2. But as argued 
above, there is only one necklace with all beads the same color and only one 
necklace with one bead of color A and the other two beads of color B. The 
only other possibility for a 3-bead necklace using three colors is to have one 
bead of each color. Thus, there must be two necklaces with one bead of each 
color (the necklaces are any cyclic order of the three colors and the reverse 
cyclic order), and the pattern inventory for black, white and red 3-bead neck- 
laces is 


P+wt+rP+ wt br t+ wb + wr + 2b + rw + 2bwr 


) = 10. From Example 1 of Section 9.3, we know that N = 11. 


Example 2 


Find the number of 7-bead necklaces distinct under rotations using three black 
and four white beads. 

We need to determine the coefficient of b*w‘ in the pattern inventory. Each 
rotation, except the 0° rotation, is a cyclic permutation when the number of 
beads is a prime [see Exercise 13(a)], so Pg = 7(x/ + 6x7). The pattern inventory 
is [(b + w)' + 6(b’ + w’)]. 

Let us again use empirical methods to simplify the expansion of the pattern 
inventory. Since the factor 6(b’ + w’) in the pattern inventory contributes 
nothing to the b*w‘ term, we can neglect it. Thus the number of 3-black, 4- 
white necklaces is simply 


: [(coefficient of b’w* in (b + w)'] = i(7) 2 


Example 3 


Find the pattern inventory of black—white edge colorings of a tetrahedron. 
Although we calculated the cycle index for corner symmetries of the tetrahe- 
dron in Example 2 of Section 9.3, we need a different cycle index for edge 
symmetries. Since the set of objects to be colored is the six edges, we need to 
consider the symmetries of the tetrahedron as permutations of the edges. 
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The 0° revolution clearly leaves all edges fixed and thus has cycle structure 
representation x’. The 120° (or 240°) revolution about a corner and the middle 
of the opposite face cyclicly permutes the edges incident to that corner and 
cyclicly permutes the edges bounding the opposite face (see Figure 9.4 in 
Section 9.1). Thus, the 120° revolution has cycle structure representation 1x3. 

The 180° revolution about opposite edges leaves those two edges fixed (see 
Figure 9.4). Since two applications of a 180° revolution return the tetrahedron to 
its original position, the other 4 edges not left fixed must be in 2-cycles. Thus, 
the 180° revolution has cycle structure representation x?x3. Then Pg = 
zs (x9 + 8x3 + 3x7x3). 

Substituting x; = (b/ + w’), we get 


3 [(b + w)’ + 8063 + wP + 306 + w)(b? + w?)?] 


= [(b° + 6b>w + 15b4w? + 20b3w? + 15b2w4 + 6bws + w) + 
(8b° + 16b°w? + 8w*) + (3b° + 6b>w + Ob4w? + 12b3w) + 9b2w4 + 6bw5 + 3w%)] 
= = (12b° + 12b5w + 24b4w? + 48b3w? + 24b?w4 + 12bw> + 12w) 


= 6° + bow + 2b4w? + 4b3w? + 2b*w4 + bw? + w® 1: 


Example 4 


Find the pattern inventory for corner 2-colorings of a floating cube. 

The symmetries of the cube involve revolutions about opposite faces, about 
opposite edges, and about opposite corners. First, of course, there is the (fixed) 
0° motion, with cycle structure representation x°. 


(a) Opposite faces: As a concrete example, we revolve about the center point 
of the pair of opposite faces abcd and efgh; see Figure 9.9a. A 90° revolution 
yields the permutation (abcd)(efgh) with cycle structure representation 
xi. A 270° revolution has the same structure. A 180° revolution yields the 


(5) 
Figure 9.9 Revolutions of the cube. (a) Revolution about opposite faces. (b) 
Revolution about opposite edges. (c) Revolution about opposite corners. 
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permutation (ac)(bd)(eg)(fh) with cycle structure representation x3. There 
are three pairs of opposite faces and so the total contribution to the cycle 
index of opposite-face revolutions is 6xj + 3x3. 


(b 


— 


Opposite edges: As a concrete example, we revolve about the middle of 
opposite edges ab and gh; see Figure 9.9b. A 180° revolution yields the 
permutation (ad)(bc)(eh)(fg) with cycle structure representation x}. There 
are six pairs of opposite edges, and so the total contribution of opposite- 
edge revolutions is 6x3. 


(c) Opposite corners: As a concrete example, we revolve about the opposite 
corners a and g; see Figure 9.9c. A 120° revolution yields the permutation 
(a)(bde)(chf)(g) with cycle structure representation xjx}. One way to see 
what this permutation does is by noting that the three corners b, d, e adjacent 
to a must be cyclically permuted in any motion that leaves a fixed (and 
similarly for the corners adjacent to g). A 240° revolution has the same 
structure. There are four pairs of opposite corners and so the contribution 
of opposite-corner revolutions is 8x7x}. 


We leave it to the reader (see Exercise 17) to verify that we have enumerated 
all symmetries of the cube and that these symmetries are all distinct. 

Collecting terms, we find Pg = g(x} + 6xj + 9x3 + 8xjx}). The pattern 
inventory for corner colorings of the cube using black and white is thus 


mn [cb + w)? + 6(bY + wh? + 9(b? + wy! + 8D + WHE? + wi) 


As in previous examples, the coefficients of the terms 5°, b’w, bw’, and 
w® are readily seen to be 1. The b°w’, b°w’, and b‘w* terms in the four factors 
of the generating function are (--- + 28b°w? + 56b°w? + 70b‘w* + ---), 
6(- + + 2b4wt + +++), 900+ + 4bow? + 6b4wt + +--+), and 8(-+ + + bow? 
+ 2b>w3 + 4b4w4 + + + +), respectively. Summing and dividing by 24 we have 
g++ + 72b°w? + 72b°w? + 168b4w4 + +: +) = 3b%w? + 3b°w? + Th‘. (It 
is easy to detect errors in these calculations, since most errors will result in a 
noninteger coefficient.) By symmetry, we fill out the pattern inventory to obtain 


b§ + b’w + 3b°w? + 3b5w? + Tb4wt + 3b3w? + 3b°we + bw? + we o 


9.4 EXERCISES 


Mb Summary of Exercises The first 16 exercises use Polya’s enu- 


meration formula, and the remaining problems involve associated theory. Note 
that ‘‘floating’’ means that all possible rotations and reflections are allowed. 


1. Find the pattern inventory for black and white corner colorings of a floating 
pentagon. 
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. Find an expression for the pattern inventory for black—white, n-bead neck- 


laces (rotations only) and find the number of necklaces with 3 white beads 
and the rest black: 


(a) n =6 (b) n=9 (ce) n= 10 (d) n= 11 


. Find the pattern inventory for black, white, and red corner colorings of a 


floating square. 


. Find an expression for the pattern inventory for the 2-colorings (rotations 


only) of the 16 squares in a 4 X 4 chessboard. 


. Find an expression for the pattern inventory for black—white corner color- 


ings of the following floating figures: 
(a) (b) (c) 


. (a) Find the pattern inventory for corner 2-colorings of a floating pyramid 


(with a square base). 
(b) Repeat part (a) for edge colorings. 
(c) Repeat part (a) for face colorings. 


. Find an expression for the pattern inventory for edge 2-colorings of the 


floating figures in parts (a) and (b) in Exercise 5. 


. Find an expression for the pattern inventory for edge 2-colorings of a 


floating cube and find the number of edge 2-colorings with 3 white and 9 
black edges. 


. Find an expression for the pattern inventory for face 2-colorings of: 


(a) A floating tetrahedron. (b) A floating cube. 


. (a) Find the pattern inventory for the corner 3-colorings of a floating 


pentagon with adjacent corners different colors. 
(b) Repeat part (a) for a floating tetrahedron. 
(c) Repeat part (a) for a floating cube. 


. Find an expression for the pattern inventory for black—white colorings of 


four indistinguishable balls (see Exercise 10 in Section 9.3). 


. Give an empirical argument (without use of the cycle index) to show that 


there are 5 different n-bead necklaces with 2 white beads and n — 2 


black beads ( Lr] is the largest integer <r). 


. Given that p is a prime and p-bead necklaces are made of black and white 


beads. 
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(a) Show that each rotation except 0° is a cyclic permutation of the corners. 
(b) What is the number of such necklaces with exactly k white beads? 


14. (a) Suppose that instead of coloring the corners of a floating square, we 
attach 0 or 1 or 2 identical jelly beans at a corner. Find a generating 
function, and expand it, for the number of different squares with a total 
of k beans at its corners. (Hint: In the inventory of patterns left fixed, 
the exponents within a factor will vary.) 


(b) Repeat part (a) with each corner getting a red jelly bean or a white 
jelly bean or a red and a white jelly bean. Now find a generating 
function for the number of squares with j whites and k reds. 


15. Suppose n batons with small holes drilled through their midpoints are 
strung along a piece of wire and each end of each baton is painted one of 
three possible colors (red, white, blue). The wire is fixed but the batons 
revolve about their centers. How many indistinguishable configurations are 
there for: 


(a) n=2 (b) n = 3. 


16. How many distinct (nonisomorphic) unlabeled graphs are there with four 

vertices? (Hint: Each possible edge has two possible colors: ‘‘edge present’’ 
“and ‘‘edge absent.’’) 

17. Show that the 24 symmetries of a cube listed in Example 4 are all distinct 
and include all symmetries of a cube. 

18. Characterize the geometric figures that when floating have only one coloring 
with one black corner and the others white. 

19. Show that there are C(n + m — 1, n) different terms in the pattern inventory 
for m-colorings of corners of an n-gon. 

20. Let a;,,, denote the minimum number of coefficients in the pattern inventory 
of all k-colorings of the corners of an n-gon needed so that using symmetry 
all other coefficients in the pattern inventory are known. Find a generating 
function g,(x) for a;,. 


—t-_———_-- 


9.5 SUMMARY AND REFERENCES 


Polya’s enumeration formula is important, practically because it solves 
important problems, mathematically because it is an elegant marriage of group 
theory. and generating functions, and pedagogically because it is the only truly 
powerful combinatorial formula we will see in this book (where a difficult 
problem can be solved by ‘‘plugging’’ the right numbers into a specialized 
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formula). Of equal importance is the manner in which Polya’s formula has 
been developed: Sections 9.3 and 9.4 could be read as a case study in the 
experimental derivation of a mathematical theory. (This approach follows the 
pedagogical style Polya used when he taught this material.) Students interested 
in a more rigorous development of this theory and its extensions are referred 
to [2], which includes an English translation of Polya’s original 1937 paper, 
‘*Kombinatorische Anzahlbestimmungen fiir Gruppen, Graphen und chemische 
Verbindungen.’’ 

In a more formal development, one defines a coloring as a function f from 
a set S (of corners) to a set R (of colors). The function /;, corresponding to C, 
abcd 
ww bb 
composition 7 - f of a motion and a color function to be a new color function 
f'; for example, 7, - 6 maps a—> b> w,b>c—>b,c>d—>b,d>a-> 


WwW, Or 7, ° fy = a = f;. The coloring permutation induced by a sym- 


(in Figure 9.1), would be written in tabular form as ( ) We define the 


metry 7 maps each f to 7 - f. Although tedious, a formal calculation of the 
coloring permutation is thus possible (before we did it by inspection). We 
define coloring equivalence by: f ~ f’ if and only if there exists 7 such that 
a: f = f'. Our theory can readily be restated in terms of this new definition. 

Polya’s enumeration formula has an important application to another field 
of combinatorial mathematics. It is used to enumerate families of graphs (see 
Exercise 16 in Section 9.4). This application was pioneered by F. Harary; see 
Harary and Palmer [1]. 


1. F. Harary and E. Palmer, Graphical Enumeration, Academic Press, New 
York, 1973. 


2. G. Polya and R.C. Reade, Combinatorial Enumeration of Groups, Graphs, 
and Chemical Compounds, Springer-Verlag, New York, 1987. 


CHAPTER 10 
GAMES WITH GRAPHS 


10.1 GRAPH MODEL FOR INSTANT INSANITY 


In this chapter we develop several useful graph concepts and theorems in the 
process of analyzing how to play some games. While clearly not an important 
use of graphs, our graph-theoretic analysis of games yields some interesting 
results. This chapter is meant as a sort of mathematical ‘‘dessert’’ following 
the more serious topics in preceding chapters. It is essential to study how graph 
models can be used to solve a variety of real-world problems, but there is a 
more personal reward in learning how graphs can permit one to solve a well- 
known difficult puzzle or to win at certain games. The first section is devoted 
to an ingenious graph model for the puzzle known as Instant Insanity. In the 
succeeding two sections, we develop the theory of progressively finite games 
and apply it to Nim-type games. 

The Instant Insanity puzzle consists of four cubes whose faces are colored 
with one of the four colors: red (R), white (W), blue (B), and green (G). The 
six faces on the ith cube are denoted: f—front face, /—left face, b;—back 
face, r;—right face, top face, and u,;—under face. In Figure 10.1, the first 
cube is colored: /, = B, r, = W,f, = R, b; = R, t, = G, u, = B. The objective 
of this puzzle is to place the four cubes in a pile (cube 1 on top of cube 2 on 
top of ... etc.) so that each side of the pile has one face of each color. For 
example, Figure 10.2 shows one solution for the pile of cubes given in 
Figure 10.1. 

We shall work with the four cubes shown in Figure 10.1. Determining how 
to arrange these four cubes in a pile that is an Instant Insanity solution is a 
very difficult task. Observe that there are 24 symmetries of a cube, and, thus 
24* = 331,776 different piles that can be built. An enumeration tree search 
will be immense, although symmetries of the face colors and the constraint of 
no repeated colors on a side will eliminate many possibilities. A computer 
program to do this search is easily written but may require a lot of computer 
time. Fortunately, we can model this puzzle with a 4-vertex graph in such a 
fashion that the graph-theoretic restatement of the puzzle can be solved by 
inspection in a few minutes. 

Before presenting the graph model, we need to discuss a simple decomposi- 
tion principle for this puzzle. Arranging the cubes in a pile so that left and 
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Figure 10.1 Cube 1 


right sides of the pile have one face of each color is ‘‘independent’’ of arranging 
the front and back sides with one face of each color. By ‘‘independent’’ we 
mean that once cube i is arranged so that a given pair of opposite faces are on 
the left and right side of the pile, then any remaining pair of opposite faces on 
cube i can be on the front and back sides of the pile. 

Let /*, r*, f*, b* denote the colors of the four respective faces of cube 
i visible on the four sides of the pile when cube i is reoriented to obtain an 
Instant Insanity solution. Suppose that /* = t; = G and r* = u, = B, that is, 
the top face ¢, of cube 1 in Figure 10.1, which is green, is reoriented to be the 
left side /*, and the under face u,, which is blue, is reoriented to be the right 
side r*. Then by rotating cube 1 about the centers of these two faces, we can 
get f* = B, b* = W, or f* = R, b* = R, or f* = W, b* = B—all possible 
remaining choices for front and back sides. Since this same left-right and 
front—back ‘‘independence’’ holds for the other cubes, we see that the puzzle 
can be broken into two disjoint problems. 


Decomposition Principle 


1. Pick one pair of opposite faces on each cube for the left and right sides 
of the pile so that these two sides of the pile will have one face of each 
color; and 


2. Pick a different pair of opposite faces on each cube for the front and back 
sides of the pile so that these two sides will have one face of each. 


Figure 10.2 
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Now we are ready to present our graph model (due to F. de Carteblanche). 
Actually we use a multigraph (in which multiple edges and loops are allowed). 
Make one vertex for each of the four colors. For each pair of opposite faces 
on cube i, create an edge with label i joining the two vertices representing the 
colors of these two opposite faces. For opposite faces 1, = B, r, = W on cube 
1, we draw an edge labeled 1 between vertex B and vertex W; for f, = R, >, 
= R, we draw a loop labeled 1 at vertex R; and for t, = G, u, = B, we draw 
an edge labeled 1 between vertices G and B. The edges for the other cubes are 
drawn similarly. Figure 10.3 shows this multigraph for the cubes in 
Figure 10.1. 

We can now restate the puzzle in graph-theoretic terms. By the decomposi- 
tion principle, we can break the puzzle into a left-right part and a front—back 
part. We initially consider just the left-right part, that is, find one pair of 
opposite faces on each cube so that the left and right sides of the pile have 
one face of each color. 

Let us simplify this left-right problem slightly by asking only for a set of 
four opposite-face pairs, one pair from each cube, such that among this total 
set of eight faces each color appears twice. Later we will show how to ensure 
that each color appears once on the left side and once on the right side. Since 
a color corresponds to a vertex, a cube to an edge number, and a pair of opposite 
faces to an edge, this simplified left-right problem has the following graph 
theoretic restatement: find four edges, one with each number, such that the 
family of eight end vertices of these four edges contains each vertex twice. 

This condition on the end vertices of the four edges is equivalent to requiring 
that the subgraph formed by these four edges has each vertex of degree 2 (a 
self-loop counts as degree 2 at its vertex). Note that a subgraph with all vertices 
of degree 2 is just a circuit or collection of disjoint circuits (a self-loop is a 
circuit of length 1). In a general n-vertex graph, a set of n edges forming 
disjoint simple circuits is called a factor of the graph. In our Instant Insanity 
model, let the term labeled factor denote a factor in which each edge number 
appears once. In Figure 10.4 we show three possible labeled factors for the 
multigraph in Figure 10.3. 

The simplified left-right problem in our Instant Insanity graph model 
reduces to: Find a labeled factor. 


Figure 10.3 
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@ @D 
(b) 
Figure 10.4 


We next show how a labeled factor can be transformed into an arrangement 
of the cubes in which the left and right sides of the pile have one face of each 
color. We direct the edges in each circuit in a consistent direction, say, in the 
clockwise direction (see Figure 10.4). Consider the labeled factor in Figure 
10.4a. As we go around this circuit, we arrange each cube with the color of 
the tail vertex of an edge as the color on the left side of the cube, and the color 
of the head vertex on the right side. One can start with any edge on the circuit; 
we pick the edge labeled 1. This 1-edge in Figure 10.4a goes from B to G, 
and so we arrange cube | so that /* (the left face of cube | in the pile) = B 
and r* = G (see Figure 10.5). Following the 1-edge on the circuit (in the 
clockwise orientation of the circuit), we next encounter a 4-edge from G to R. 
Accordingly we arrange cube 4 so that /* = G and rj = R. Next comes a 2- 
edge from R to W followed by a 3-edge from W to B. So we arrange cubes 2 
and 3 with /* = R, r* = W and /* = W, r¥ = B. Figure 10.5 shows the left 
and right sides (only) of the pile made by the four cubes arranged as just 
described. 

This process assures that each color appears once on each side, since each 
vertex (color) is at the head of one edge and at the tail of one edge. If we had 


a! (| cube 1 


ae W Cube 2 
] 


Cube 3 


Cube 4 


Figure 10.5 
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chosen the labeled factor in Figure 10.46, we would have used the clockwise 
traversal procedure for all three circuits, yielding /* = B, r* = G,/1* = G, 
r¢ = B for one circuit, and /* = r¥ = R and /¥ = r¥ = W for the two self- 
loops. 

We are now ready to solve the Instant Insanity puzzle. Restating the decom- 
position principle in terms of labeled factors: 


Graph-theoretic Formulation of Instant Insanity 


1. Find two edge-disjoint labeled factors in the graph of the Instant 
Insanity puzzle, one for left—right sides and one for front—back 
sides; and then 


Use the clockwise traversal procedure to determine the left-right 
and front—back arrangements of each cube. 


It is not hard to find two disjoint labeled factors by inspection. The three 
labeled factors in Figure 10.3 all use the same 1-edge (between B and G). So 
no two of these three factors are disjoint. The easiest approach is to find one 
labeled factor, delete its edges, and look for a second labeled factor. If none 
is found, start with a different labeled factor. For example, suppose we use the 
labeled factor in Figure 10.4a as our first factor. After deleting its edges from 
the graph in Figure 10.3, we easily find a second labeled factor. Such a second 
labeled factor is shown in Figure 10.6 (the reader should be able to find a 
second factor). 

We use the factor from Figure 10.4a to arrange the left and right sides, as 
shown in Figure 10.5. Next we rotate each cube about the centers of its left 
and right faces to arrange the front and back faces according to a clockwise 
traversal of the circuits in Figure 10.6. Starting with the 2-edge, we make f* 
= W, by = B, f¥ = B, b¥ = G, fi = G, bi = W and f# = b# = R. Figure 
10.7 shows the resulting solution of the Instant Insanity puzzle. Now go buy 
or borrow a set of Instant Insanity cubes and show your friends that you learned 
something really useful from this book! 


Figure 10.6 ©) 
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10.1 EXERCISES 


ZZ 1. Find all other labeled factors of the multigraph in Figure 10.3 besides the 


ones in Figure 10.4 and Figure 10.6. Give an argument in the process to 
show that there can be no other labeled factors. 


2. Find all Instant Insanity solutions (a disjoint pair of labeled factors) to the 
game with associated graph (a). 


3. Find all Instant Insanity solutions to the game with the associated graph 
(b) shown above. 


4. Show carefully that Instant Insanity graph (c) shown above does not possess 
a solution, that is, a pair of disjoint labeled factors. 


5. (a) If a graph G has a Hamiltonian circuit, must it also have a factor? 
Prove true or give a counterexample. 


(b) Repeat part (a) for the case of G having an Euler cycle. 


6. Find a factor in the following graphs, if possible: 
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(c) b 


7. Show that the definition of a factor in a graph implies that each vertex is 
incident to exactly two edges (or one self-loop) in a factor. 


8. (Problem based on material in Chapter 4.) A directed factor in an n-vertex 
directed graph is a set of n edges with one of these edges incident from 
and one incident to each vertex. Show how the problem of finding a directed 
factor in a directed graph can be formulated as a matching or network flow 
problem. 


—______y 


10.2.» PROGRESSIVELY FINITE GAMES 


In this section we develop the mathematical theory of progressively finite 
games. A game in which two players take turns making a move until one player 
wins (no ties are allowed) is called progressively finite if: (1) there is a finite 
choice of moves possible at any particular position in the play of the game; 
(2) the play of the game must end after a finite number of moves; and (3) there 
is a finite number of possible starting positions. 

Our objective in this section is winning: how to determine winning strategies 
for progressively finite games. We can model a progressively finite game by 
a directed graph with a vertex for each position that can occur in the play of 
the game and a directed edge for each possible move from one position to 
another.* 

Observe that the graph of a progressively finite game cannot contain any 
directed circuits, since players could move around and around a circuit of 
positions forever (violating the constraint on finite play). Thus, no positions 


* There is one technical point here. This book’s definition of a graph requires that it have a finite number 
of vertices, but the definition of progressively finite games does not explicitly require that there be a 
finite number of positions; it requires only a finite number of moves from any position, finite starting 
positions and finite play. Fortunately, it can be proved that a progressively finite game has in total only 
a finite number of different possible positions. This result is known as Konig’s Infinity’s Lemma. 
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can ever be repeated in the play of a game. Rather, the game moves inexorably 
toward some final position that is a win for one of the players. Games such as 
checkers and chess that permit ties and repetition of positions are not progres- 
sively finite. Most progressively finite games are ‘‘takeaway’’-type games of 
the sort illustrated by the games in Examples 1 and 2. 


Example 1 


A set of 16 objects is placed on a table. Two players take turns removing 1, 
2, 3, or 4 objects. The winner is the player who removes the last object. The 
graph of this game is shown in Figure 10.8 (all edges are directed from left to 
right). = 


Example 2 


Starting with an empty pile, two players add 1 penny or 2 pennies or 1 nickel 
to the pile until the value of the pile is the square of a positive integer = 2 or 
until the value exceeds 40. The player whose addition brings the value of the 
pile to one of these critical amounts takes all the money in the pile. The graph 
of this game is shown in Figure 10.9 (all edges are directed from left to right; 
arrows point to the game-winning positions). This game is an inverted form 
of a ‘‘takeaway’’ game in which 1, 2, or 5 objects are withdrawn from a pile 
until the pile is reduced to certain critical sizes. m= 


The reader is encouraged to try playing these games with a friend. A 
winning strategy for the first player (who makes the first move) in the game 
in Example 1 can be found with a little thought. The game in Example 2 is 
harder. The details of this second game only serve to confuse systematic attempis 
to find a winning strategy. It is easier to develop a general theory of winning 
strategies in progressively finite games and then apply the theory to Exam- 
ple 2. 

A winning position in a progressively finite game is a position at which 
play stops and the player who moved to this position is declared the winner. 

In the graph of a progressively finite game, vertices with 0 out-degree must 
represent winning positions, for if no edges leave a vertex, the game must stop 
at that vertex. We call such vertices winning vertices. The graph of the game 
in Example 1 has just one winning vertex, numbered 0, while the graph of the 
game in Example 2 has six winning vertices, numbered 4, 9, 16, 25, 36, and 
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‘over 40.’’ Every progressively finite game must have at least one winning 
position or else play would go on without end. 

A winning strategy for a player is a rule that tells the player which move 
to make at each stage in a game so as to ensure that the player will eventually 
win, that is, finally move to a winning position. Obviously only one player can 
have a winning strategy. 

Any vertex adjacent to a winning vertex is a “‘losing’’ vertex in the sense 
that if one player moves to a ‘‘losing’’ vertex, the other player can then move 
to a winning vertex (and win the game). The vertices numbered 1, 2, 3, and 4 
are losing vertices in the game in Example 1. The vertices numbered 23.3515 
8, 11, 14, 15, 20, 23, 24, 31, 34, 35, 37, 38, 39, and 40 are losing vertices in 
the game in Example 2. 

Stepping one more move from the end of the game, we see that if all edges 
from a vertex x go to losing vertices, then x is a ‘‘prewinning’’ vertex. Whenever 
player A moves to such a prewinning vertex, then player Bs next move must 
be to a losing vertex and now player A can move to a winning vertex. Vertex 
5 in the game in Example 1 is a prewinning vertex, since all edges from 5 go 
to losing vertices (1, 2, 3, and 4). Vertices 6 and 33 are the prewinning vertices 
in the game in Example 2; all other vertices adjacent to losing vertices are also 
adjacent to nonlosing vertices. 

The general theory of winning strategies in progressively finite games is 
based on a recursive extension of the preceding reasoning. We seek good 
vertices, such as winning and prewinning vertices, that win or lead only to bad 
vertices. When our opponent is forced to move to a bad vertex, such as losing 
vertices, then we will always be able to move to another good vertex. A winning 
strategy for the first player will tell the player how to find good vertices to 
move to from any bad vertex. This pattern of play, with the first player moving 
to successive good vertices and the second player forced to move to bad vertices, 
will continue until finally the first player reaches a good vertex that is a winning 
vertex. If the game starts at a good vertex and the first player’s initial move 
must be to a bad vertex, then roles are reversed and it is the second player 
who now has a winning strategy using the good vertices. 


Example 1 (continued) 


The winning strategy for the first player in this game is to move to a vertex 
whose number is a multiple of 5. These are the good vertices. Thus, the first 
move from the starting vertex 16 is to vertex 15 (i.e., the first player removes 
one object). From 15, the second player must move to one of the vertices 11, 
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12, 13, or 14. From any of these bad vertices the first player now moves to 
vertex 10. Whatever the second player’s next move is, the first player will 
always be able to move to the prewinning vertex 5, and one round later the 
first player will win. Note that if the game started with only 15 objects, then 
the roles would be reversed and the second player would be able to use this 
winning strategy. m= 


We formalize the concept of good vertices with the following definition. 
A kernel in a directed graph is a set of vertices such that: 


1. There is no edge joining any two vertices in the kernel; and 
2. There is an edge from every nonkernel vertex to some kernel vertex. 


The kernel of the graph in Example 1 is the set of vertices numbered 0, 
5, 10, and 15. 


Theorem 1 


If the graph of a progressively finite game has a kernel K, then a winning 
strategy for the first player is to move to a kernel vertex on every turn. However, 
if the starting vertex is in the kernel, then the second player can use this winning 
strategy. 


Proof 


First we show that all winning vertices must be in K. The reason is that vertices 
not in a kernel must have an edge directed to a vertex in the kernel. But winning 
vertices have 0 out-degree. Thus, all winning vertices must be in any kernel. 

Next we show that moving to a kernel vertex on every turn is a winning 
strategy for the first player. By property (1) of a kernel, when the first player 
moves to a kernel vertex, the second player must then move to a nonkernel 
vertex. Then by property (2), the first player can always move from the nonkernel 
vertex to a kernel vertex. The play proceeds in this fashion with the first player 
always moving to a kernel vertex and the second player always moving to a 
nonkernel vertex. Since the game is progressively finite, the play must eventually 
end. Since all winning vertices are in the kernel, the first player must win. 

If the starting vertex is in the kernel, then the roles of the first and second 
players are reversed and the second player can always move to a kernel vertex 
for an eventual win. @ 


The preceding proof does not show explicitly how successively moving to 
kernel vertices leads to a winning vertex. The proof is existential. It shows 
only that by moving to kernel vertices the first player must eventually arrive 
at a winning vertex. 

A more immediate problem is to prove that the graph of any progressively 
finite game has a kernel and to show how to find this kernel. Not all graphs 
have kernels. For example, the graph in Figure 10.10 has no kernel. To show 
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a b 


Figure 10.10 d 


that graph has no kernel, we argue thus. If there were a kernel, we could assume 
by the symmetry in this graph that vertex a is in the kernel. Then vertex e, 
which has an edge to a, cannot be in the kernel. Vertex d’s only outward edge 
goes to nonkernel vertex e, and so d must be in the kernel. Similarly, c cannot 
be in the kernel and then b must be in the kernel. But now the edge (aj }) 
joins two kernel vertices—a contradiction. 

Fortunately, graphs of progressively finite games do have kernels. To dem- 
onstrate the existence of kernels in such graphs, we need to organize the vertices 
in a progressively finite graph into levels based on the ‘‘distance’’ of the vertices 
from a winning vertex. We recursively define the level /(x) of vertex x in a 
directed graph and the sets L, of vertices at level = k as follows. Let s(x) = 
{y | x has an edge to y} be the set of successors of x. Then 


I(x) = 0 © s(x) is empty and = Ly = {x | (x) = 0} 

Ix) =1eax € Ly and s(x) Cy and = LL, = Ly U {x| Ux) = 1} 
and in general, 

Iny=kKexEL,andsayCh and L=k-,U {x| lx) =k} 


Observe that L, — L,_, is the set of vertices at level k. 

It can be shown that /(x) is the length of the longest path in the directed 
graph that starts at x (see Exercise 12). Since all paths in a progressively finite 
graph have finite length, the longest path from a vertex x has finite length. 
Further, the longest path from x must end at a vertex of out-degree 0; otherwise 
the path could be extended. Then starting from vertices of out-degree 0, the 
assignment of level numbers by this recursive definition will eventually reach 
all vertices. 

It follows from this definition of level that every vertex at level 0 is a 
winning vertex and that every vertex at level 1 is a losing vertex. A vertex at 
level 2 is also a losing vertex if it is adjacent to a vertex at level 0; it is a 
prewinning vertex only if all its successors are at level 1. Also, every vertex 
at level k (k > 0) must be adjacent to a vertex at level k — 1 but cannot be 
adjacent to any other vertex at level k (or greater). Now we can prove the 
fundamental theorem for progressively finite games. 


Theorem 2 


Every progressively finite game has a unique winning strategy. That is, the 
graph of every progressively finite game has a unique kernel. 
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Proof 


The proof is by induction on the levels or, more precisely, on the sets L;. Let 
K, be the set of kernel vertices in L,. First consider the set Lp. Lo consists of 
vertices with 0 out-degree. These are the winning vertices. As noted in the 
proof of Theorem 1, all winning vertices must be in the kernel. Thus Ko = Lo. 

Next let us inductively assume for n = 1 that K,-, is the unique, well- 
defined set of kernel vertices in L,-;. We must show that we can find a unique 
set of level-n vertices that must be added to K,-; to form the kernel K,, for L,. 
By the way that level numbers were defined, /(x) = 1 means that s(x) € L,-1. 
If a level-n vertex x is adjacent to no kernel vertex of K,-;, this x must be in 
K,, (since by the definition of a kernel, any vertex with no successors in the 
kernel must itself be in the kernel). On the other hand, if x is adjacent to a 
kernel vertex of K,-;, x cannot be in the kernel. Hence K, = K,-, U {x I(x) 
= n and s(x) N K,-,; = @} is the unique, well-defined set of kernel vertices 
in L,. It follows by mathematical induction that the graph has a unique kernel. 
By Theorem 1, this kernel is the unique winning strategy. @ 


The proof of Theorem 2 tells us how to build a kernel. First put the winning 
vertices in the kernel. Then recursively add the vertices at successive levels 
not adjacent to the current set of kernel vertices. We implement this procedure 
for finding kernels using a labeling rule called a Grundy function. 


Definition of Grundy Function g(x) 


For each vertex x in a directed graph, g(x) is the smallest nonnegative 
integer not assigned to any of x’s successors. 


We shall prove shortly that vertices with Grundy number 0 are exactly the 
set of kernel vertices. In the next section, Grundy numbers will be seen to play 
a fundamental role in more complex games. 

In the graph of a progressively finite game, Grundy values can be easily 
determined using a level-by-level approach. The vertices on level 0 have no 
successors and so their Grundy number will be 0 (the smallest nonnegative 
integer). Next we determine g(x) for vertices x at level 1, then vertices at level 
2, and so forth. In this way, all of a vertex x’s successors are assigned Grundy 
numbers before it is time to determine the Grundy number of x, since x’s 
successors are at lower levels. When we come to x, we can check the Grundy 
function values of s(x), x’s successors, and set g(x) equal to the smallest 
nonnegative integer not assigned to any vertex in s(x). Actually we do not need 
to proceed in a totally level-by-level fashion. We can use any method that does 
not try to assign a vertex its Grundy number until all the vertex’s successors 
have Grundy numbers. 
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No Grundy function can be defined for the graph in Figure 10.10. Each 
vertex x in Figure 10.10 has a successor whose Grundy number must be defined 
before g(x) can be determined. Even if we try to invent simultaneously g(x) 
for all vertices in Figure 10.10 at once, no Grundy function exists (details are 
left to the reader). 

The Grundy numbers for the vertices in Figures 10.8 and 10.9 are given 
in the following tables. Note that the vertices in Figure 10.8 that are in the 
kernel—namely, 0, 5, 10, 15——all have Grundy number 0. This is no accident. 


Table of Grundy Numbers for Figure 10.8 


Vertex x 16 15 14 13 12 11 10 9 8 765 4 3 


2 1 0 
g(x) 1 0 4 3 2 1 043 21043 2 «21 ~0 
Table of Grundy Numbers for Figure 10.9 


r 
Vertexx 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 
eo). 10 S20 110 2 1 20. 3 1-0 8 1 2 0 


17 18 19 20 21 22 23 24 25 26 27 28 29 30 
1 0 2 1 0 3 2 1 0 2 0 1 2 +0 


31 32 33 34 35 36 37 38 39 40 over 40 
1 2 0 1 2 0 1 3 2 ~=«1 0 


Theorem 3 


The graph of a progressively finite game has a unique Grundy function. Further, 
the vertices with Grundy number 0 are the vertices in the kernel. 


Proof 


The recursive level-by-level construction of a Grundy function for the graph 
of a progressively finite game gives each vertex a unique Grundy number, as 
described above. By the definition of a Grundy function, a vertex x with Grundy 
number 0 cannot have a successor with Grundy number 0 (or else x would 
have to have a different number). Similarly any vertex y with Grundy number 
g(y) = k > 0 must have an edge to some vertex with Grundy number 0 (or 
else y’s number would be 0). Thus, the set of vertices with Grundy number 0 
satisfies the two defining properties of a kernel. @ 


Example 2 (continued) 


By Theorem 3 and the foregoing table of Grundy numbers for Figure 10.9, we 
see that the kernel is the set 0, 4, 6, 9, 12, 16, 18, 21, 25, 27, 30, 33, 36, ‘‘over 
40.”’ Since the starting vertex is in the kernel, the second player has the winning 
strategy in this game. A play of the game might proceed as follows (let player 
A be the first player and B be the second player): first A moves to 1, then B 
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moves to kernel vertex 6, then A must move to 11 (a move to 7 or 8 lets B 
win at 9), then B moves to kernel vertex 12, then A must move to 17, then B 
moves to kernel vertex 18, then A moves to 20, then B to 21, then A to 22, 
then B to 27, then A to 32, then B to 33, then A to 35 (A senses defeat and 
gives up), and then B moves to the winning vertex 36 (and collects the 36 
cents). m 


10.2 EXERCISES 


Summary of Exercises The first 10 exercises involve finding 
kernels and Grundy functions in various graphs of progressively finite games. 
Exercises 11-17 involve proofs of properties of kernels, Grundy functions, and 
level numbers. Exercises 18 and 19 present two more complicated progressively 
finite games. 


1. Find a kernel in the following graphs or show why none can exist: 


(a) (b) Coe ae 
| Fd 

Ss ; 
an nm 

d 8 h 


2. Suppose that there are 25 sticks and two players take turns removing up 
to 5 sticks. The winner is the player who removes the last stick. Which 
player has a winning strategy? Describe this strategy. 


3. Repeat Example 2 with 2, 3, or 7 cents added each time. Find the set of 
positions in the kernel. 


4, Show that in Example 2 the second player can always win by the second 
move. 


5. Suppose in Example 2 that the first player A knows he/she will lose and 
wants to minimize his/her loss. What is the smallest winning amount A 
can force B (the second player) to accept (such that if B tried to make the 
game go longer, then A could get into the kernel and win)? 


6. (a) Suppose we have a pile of 7 red sticks and 10 blue sticks. A player 
can remove any number of red sticks or any number of blue sticks or 
an equal number of red and blue sticks. The winner is the player who 
removes the last stick. Find the set of positions in the kernel. 

(b) Repeat part (a) with a limit of removing at most 5 sticks of one color 
(or both colors) on a move. 


7. Repeat Example 1 but now the player to remove the last object loses. 
Describe the winning strategy for this game. 
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. Find the Grundy function for graphs or games in: 


(a) Exercise 1(a) (b) Exercise 1(b) (c) Exercise 3 


. Show that there is no Grundy function for the graph in Figure 10.10. 
. Find a directed graph possessing a kernel but no Grundy function. 
. If W(S) is the set of vertices without an edge directed to any vertex in the 


set S, show that a set S is a kernel if and only if S = W(S). 


. (a) Show that if /(~) = k for a vertex x in the progressively finite graph 


G, then k is the length of the longest path starting at x in G. 
(b) Show that if g(x) = k, there is a path of length & starting at x in G. 


. Show that for any vertex x in a progressively finite graph, g(x) = /(x). 
. Show that if a directed graph G and every subgraph of G (obtained by 


deleting various vertices) have kernels, then G has a Grundy function. 


. Show that both the level numbers and the Grundy function in a progressively 


finite graph G constitute proper colorings of G. 


. Show that no matter how the edges are directed in a bipartite graph, it will 


always have a Grundy function. 


Show that the graph of a progressively finite game can have only a finite 
number of vertices. [Hint: Show that if there are an infinite number of 
vertices, then there must be an infinite path (infinite play).] 


. Consider the following graph game. Player A tries to make a path with a 


set of vertices from the left to the right side of this graph. Player B tries 
to make a path from top to bottom. The players take turns picking vertices 
until one player gets the desired path. 


ANA 


iN 
VANVAN 
VAINVAIS 


WA 
VA 
VANVAIS 


(a) Show that this is a progressively finite game. 


(b) Find a winning strategy for Player A. 


. The game of kayles has a row of n equally spaced stones. Two players 


alternate turns of removing one or two consecutive stones (with no interven- 
ing spaces). The player to remove the last stone wins. 


(a) Draw the graph and find its Grundy function for a four-stone game. 
(b) Repeat part (a) for a six-stone game. 
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10.3 NIM-TYPE GAMES 


In this section we extend the theory of progressively finite games to takeaway 
games involving several piles of objects. The simplest game of this form is 
called Nim, a game in which two players take turns removing any amount they 
wish from one of the piles. The winner is the player who removes the last 
object from the last remaining (nonempty) pile. While the positions in the two 
games in Examples | and 2 in the previous section could be described with a 
single nonnegative integer representing the size or monetary value of the single 
pile, the position in a Nim game requires a vector of nonnegative integers 
(Pi, Pr» »++, Pm), the Ath number p, representing the current size of the kth 
pile. 


Example 1 


Consider the Nim game with four piles of sticks: one stick in the first pile, 
two sticks in the second pile, three in the third, and four in the fourth. See 
Figure 10.11. We represent the initial position of this game with the vector 
(, 2, 3, 4). 

Let the first and second players be named A and B, respectively. A sample 
play of the game might go as follows. First A removes all four sticks from the 
fourth pile. The new position is (1, 2, 3, 0). Next B removes one stick from 
the third pile to produce position (1, 2, 2, 0). Now A removes the one stick in 
the first pile to produce position (0, 2, 2, 0). A has been playing a winning 
strategy, that is, moving into kernel positions, and is now about to win. If B 
removes all of the second or third pile, A will remove the other pile; or if B 
removes just one stick, A will remove one stick from the other pile and A will 
win on the next round. The reader is encouraged to play this Nim game with 
a friend. = 


The game in Example | has 2 x 3 x 4 X 5 = 120 different positions (the 
ith pile has i + 1 possible sizes, 0, 1,..., 7). Thus, it would be very cumbersome 
to draw the graph of this game and compute its kernel with a Grundy function, 
as in the previous section. In general, a Nim game with m piles and n; objects 
in the ith pile will have (n; + 1)(m. + 1) +--+ (m,, + 1) different positions. The 


Figure 10.11 | 11! 
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Figure 10.12 


only way that we could ever play winning Nim without a computer is if we 
could determine the Grundy number of a position (vertex) directly from the 
position vector (p;, P2,. . . ,Pm)- Fortunately, such direct computation is possible. 

First we need to examine the Grundy function for a single-pile Nim game. 
Consider the Nim game with one pile of four sticks. See Figure 10.12. The 
graph of this game has edges (i; j), for all 0 = j = i = 4. The Grundy number 
of vertex 0, the winning vertex, is 0; the Grundy number of vertex 1 is 1; and 
so on. In any one-pile Nim game, vertex i will have a Grundy number of i, 
since vertex i has edges to all lower-numbered vertices. Of course, the strategy 
of any one-pile Nim game is trivial: the first player removes the whole pile 
and wins. The nice form of this single-pile Grundy function will simplify the 
computation of Grundy functions for multi-pile Nim games. 

Although the graph of a multi-pile Nim game is too complex to draw, we 
can still describe it symbolically. We have a vertex for each position (p,, Po, 

. 5 Pm)» Where 0 <= p; = n; (n; is the initial size of the ith pile). Since the 
only permissible moves are removing some amount from one pile, the associated 
graph has edges from vertex (pj, P2, ---, Pm) to vertex (1, G2, +++» dm) for 
each pair of vertices such that for one j, g; < p;, and for all i # j, q; = p;. This 
graph is in some sense a ‘‘composition’’ of the graphs for each pile, for if we 
fix all p; except one, say ps, then the subgraph of vertices (p, po, 9, Pa .--, 
Pm)s (Pis Pry 1s Pay «+ +> Dds ++ +> (Pts Pry M3 Pas +++ 5 Pm ) 18 exactly the graph 
for pile three alone. 

This type of composition of graphs can be formalized as follows. The 
direct sum H = H, + H, +--+ + H,, of graphs H,, Az, ....Hm with vertex 
sets X,, X2,..., Xm, respectively, has vertex set X = {(%1, X2, 6 ++ Xm) | x; E 
X,, 1= i <= m} and edges defined by the successor sets 


(0,0) (1,0) (2,0) 
0 1 2 


Figure 10.13 (a) (b) 
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S((, XQ, - + +s Xm)) = {(y, X72, X3, -- + Xm)ly € 5(x,)} 
U {x ys X34, 2 + Xm ly € X(%)} 
U hee: Xap es Xm-t, YY E 5%} 


It follows from this definition that the graph G of an m-pile Nim game is the 
direct sum of the graphs G; of the ith pile: G = G, + G. +--+ + G,. 


Example 2 


Consider the simple Nim game with two piles of two objects each. Figure 
10.13a shows the graph G, = G, of one pile alone. Figure 10.136 shows the 
graph G = G, + G, of the two-pile game. Find the Grundy function and kernel 
of this graph. 

The Grundy numbers for the vertices are shown in Figure 10.136. The 
vertices with Grundy number 0 are in the kernel. Since 2((2,2)) = 0, the second 
player has the winning strategy, according to Theorems 1 and 3 in the previous 
section. 


Next we show how to compute the Grundy number of a vertex (x), %,..., 
Xm) in a direct sum of graphs from the Grundy numbers g(x;) of the vertices 
x; in the component graphs G;. The computation to be performed on these 
Grundy numbers is called a digital sum. 

The digital sum c of nonnegative integers c,, C2, ..., Cm, Written 


c=c,¢taq+o4+°-++c, 


is computed in the following manner. Let c® be the Ath binary digit in a binary 
expansion of c, that is, c = c® + c%2 + c®2? + +++, and similarly let c® 
be the Ath digit in a binary expansion of c;. Then c® = ci + cH? +--+ 4+ 
ci’ (modulo 2). That is, the Ath binary digit of c is 1 if the sum of the th 
binary digits of the c;s is odd, and the Ath binary digit of c is 0 if the sum of 
the kth binary digits of the c;’s is even. 


Example 3 


Compute the digital sum 2 + 12 + 15 + 8. We write the numbers 2, 12, 15, 
and 8 in binary form and determine the sum (modulo 2) of the digits in each 
column, as illustrated in Figure 10.14. Translating the value of the binary sum 
back into an integer, we see that this digital sum equals 9. a 


We now present a remarkable theorem. In essence, it says that digital sums 
are the only way to win at Nim. 
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2). 3 9e 2h 2 


2? 2} 7 
Figure 10.14 1=0 0 1 
Theorem 
If the graphs G,, G2, ..., G, possess Grundy functions g(x), then the direct 


sum G = G, + G, + - ++ + G,, possesses a Grundy function g(x) defined by 
BCX) = B((X1, Xa Xm) = Bi) + Bal) + +++ + Bnm) 


Proof 


We must show that g(x) is the smallest nonnegative integer that is not equal 
to any number in the set {g(y) | y € s(x)}. This definition of a Grundy function 
can be broken into two parts: (1) If y © s(x), then g(x) # g(y): and (2) for 
each nonnegative integer b < g(x), there exists some y € s(x) with g(y) = b. 
Part (1): Show that if y © s(x), then g(x) # g(y). Since y = (1, yo, 
~> Ym) © S((X1, X2, ..-, Xm)), then by the definition of direct sum, for some 
Js yj © s(x), and for all i # j, y; = x;. If c, = g(x, and d, = g,(y,), then 
c; # d; and c; = d;. So 
a(xy=cp tat: -tet-:--t+e,=c' te, 
ey =dtat+-:-tdjt+-+++d,=c' +d, 


where c’ is the digital sum of all cs except c;. It is not hard to show that c’ 
+c; = c’' + d, if and only if c; = d; (see Exercise 7). Since c; # dj, we 
conclude that g(x) # g(y), as required. 

Part (2): Show that for each nonnegative integer b < g(x), there exists 
some y € s(x) with g(y) = b. A general proof of this part is fairly technical 
(see Berge [2], p. 25, for details). The practical side of this proof, discussed 
below, is finding a y with g(y) = 0 (a kernel vertex) and moving to it. A formal 
proof of part (2) is a generalization of the discussion below. 
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Corollary 


The vertex (p,, 2, ..., Pm) in the graph of an m-pile Nim game has the Grundy 
number p; + p. + °° + Pm. Thus (pi, P2,.-++ + Pm) is a kernel vertex if and 
only if p) + pp +--+ +p, = 0. 


The reader should go back to the Nim game of two piles of two objects 
each in Example 2 and check that the Grundy numbers obtained for the vertices 
of the graph of that game are the digital sums of the pile sizes. 


Example 1 (continued) 


The Grundy number of the starting vertex of the Nim game in Figure 10.11 is 
the digital sum 1 + 2 + 3 + 4 = 4, as computed in Figure 10.15. The first 
player A wants to decrease the size of one of the piles so that the new digital 
sum is 0 (a kernel position). That is, A should alter the binary digits in one 
row of Figure 10.15 so that the sum (bottom) row is 000. For the sum row 
to become all Os, every column now having an odd number of 1s should have 
one of its digits changed (either a 1 to a 0 or a0 to a 1) to make the number 
of 1s in that column even. 

Since the sum row has a | in just the 2* column, we can change the (single) 
I in that column to a 0. That is, pile 4’s binary expansion should be changed 
from 1 0 0 to 000, and so player A should remove all sticks in the fourth pile. 
This new position (1, 2, 3, 0) has a Grundy number of 0. Note that to make 
the number of 1s even in the 2? column, we could not change any 0 to a 1, for 
the new binary expansion in the altered row would then be a larger number— 
an impossible move in Nim. m 


We now generalize the method of finding a vertex with Grundy number 0 
given in the preceding example. Form a digital sum table as in Figure 10.15 
for the current game position. Pick a row e having a 1 in the leftmost column 
that has an odd number of 1s, that is, having a 1 in a column with a 1 in the 
sum row. In row e, change the digit in every column having a 1 in the sum 
row. After this change of digits, every column will have an even number of 
Is, and so the sum row will be all Os. Thus, this new position will be a kernel 
vertex. Note that since the leftmost digit that is changed in row e is a 1 (this 
is how row e was chosen), changing digits in row e will yield a smaller number, 


2=0 1 0 
3=0 1 1 
4=1 0 0 


Figure 10.15 1 0 0=4 
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22,2. 320 
tI 2=0 1 O 
It] 3=0 1 1 
aoe 4=1 0 0 
lHi1tl 6=1 1 0 
0 1 1=3 
Figure 10.16 (a) (b) 


call it h. The first player should thus decrease the size of the eth pile to a size 
of h objects. 


Example 4 


Consider the Nim game of four piles with 2, 3, 4, and 6 sticks shown in Figure 
10.16a. The digital sum table for the initial position is shown in Figure 10.16b. 
The 2' column is the leftmost column with a 1 in the sum row, and so we must 
change a row with a 1 in the 2' column. We can use the first, second, or fourth 
row. Suppose that we choose the first row. Then since the 2' and 2° columns 
in the sum row have 1s, we change the digits in these columns in the first row. 
The new first row is 00 1. Thus, the first player should reduce the size of the 
first pile to 1. The reader should continue the play of this Nim game with a 
friend (or against oneself) to practice this kernel-finding rule. = 


10.3 EXERCISES 


Mo 1. Find the Grundy number of the initial position and make the first move in 


a winning strategy for the following Nim games: 


1 | | I | | | 
(a) I 11 (b) | 11 (c) 1111 (d) 111 

111 PI td Pld IY tl 

I1ttl Prtd ttl PIII Itt 


2. Suppose that no more than two sticks can be removed at a time from any 
pile. Repeat the games in Exercise 1 with this additional condition. 

3. Suppose that no more than i sticks can be removed at a time from the ith 
pile (piles are numbered from top to bottom). Repeat Exercise 1 with this 
additional condition. 
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10. 
11. 


. For the Nim game in Exercise l(c), find moves that yield positions with 


Grundy number equal to: 
(a) 1 (b) 2 (c) 3. 


. Suppose three copies of the game in Example 2 of Section 10.2 are played 


simultaneously. Players stop adding money to a pile when the value of the 

pile is a square or exceeds 40. The player who adds the last amount to the 

last pile wins the money in all three piles. Note that there is a table of 

Grundy numbers for this game near the end of the previous section. 

(a) If initially there are 2¢ in two piles and 1¢ in the third pile, what is 
the Grundy number of this position and what is a correct winning 
move? 

(b) Find all kernel positions in which the sum of the money in the first 
two piles is less than 10¢ and the third pile is empty. 

(c) Using the table of Grundy numbers for the one-pile game near the end 


of Section 10.2, write a computer program to compute the next winning 
move in this three-pile game. 


. Draw the direct sums involving the graphs shown below. 
(a) G, + G, (b) G, + G, + G; 
A4, x2 J1 ez 
Gy x3 G2 G3 
*4 ¥2 722 
. Prove the assertion in the proof of the theorem that c’ + c; = c’ + d,if 


and only if c; = dj. 


. Generalize the argument for finding kernel vertices in Nim (preceding 


Example 4) to obtain part (2) of the proof of the theorem. 


. Consider the variation of Nim in which the last player to move loses, Show 


that the winning strategy is to play the regular last-player-wins Nim strategy 
but when only one pile has more than one stick now decrease that pile’s 
size to 1 instead of 0 (or to 0 instead of 1). 

Write a computer program to play winning Nim. 

Consider the following variation on Nim. Place C(n + 1, 2) identical balls 
in a triangle, like bowling pins. Two players take turns removing any 
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number of balls in a set that all lie on a straight line. The player to remove 
the last ball wins. 


(a) Find a winning first move for this game when n = 3. 
(b) Repeat for n = 4. 


sy 


~10.4 SUMMARY AND REFERENCES 


The graph model for Instant Insanity is due to de Carteblanche [4] (reputedly 
a pseudonym for the famous graph theorist W. Tutte). The first published 
analysis of Nim appeared in 1902 by C. Bouton [3]. The Grundy function for 
progressively finite games was presented by P. Grundy [7] in 1939. It was 
independently discovered a few years earlier by the German mathematician 
Sprague. 

This chapter has only scratched the surface of the theory of finitely progres- 
sive games. Interested readers should turn to On Numbers and Games, by J. 
Conway [5], the world’s acknowledged master gamesman, and to Winning 
Ways by Berlekamp, Conway, and Guy [1]. The reader should also see the 
classic book about mathematics and games, Mathematical Recreations and 
Essays by Coxeter and Ball [6]. 


1. E. Berlekamp, J. Conway, and R. Guy, On Winning Ways, 2 volumes, 
Academic Press, New York, 1982. 
2. C. Berge, The Theory of Graphs, Methuen-Wiley, New York, 1962. 


3. C.L. Bouton, “‘Nim, a game with a complete mathematical theory,’’ Ann. 
Math. 3 (1902), 35-39. 


4. F. de Carteblanche, ‘‘Pile of cubes,’’ Eureka, April, 1947. 
5. J. H. Conway, On Numbers and Games, Academic Press, New York, 1976. 


6. H. Coxeter and W. Ball, Mathematical Recreations and Essays, University 
of Toronto Press, 1972. 


7. P.M. Grundy, ‘‘Mathematics and games,’’ Eureka, January, 1939. 


APPENDIX 


A.1 SET THEORY AND LOGIC 


A set is a collection of distinct objects. In contrast to a sequence of objects, a 
set is unordered. Usually we refer to the objects in a set as the elements, or 
members, of the set. These elements may themselves be sets, as in the set of all 
5-card hands; each hand is a set of 5 cards. A family is a collection in which 
multiple appearances of objects is allowed; for example, {a, a, a, b, b, c}. 

A set S is a subset of set T if every element of S is also an element 
of T. In the case of a 5-card hand, we are really working with a 5-card subset 
of the set of all 52 cards. Any set is trivially a subset of itself. A proper subset 
is a nonempty subset with fewer elements than the whole set. Two sets with 
no common members are called disjoint sets. 

This text uses capital letters to denote sets and lowercase letters to denote 
elements (unless the elements are themselves sets). The number of elements 
in a set S is denoted by N(S). The symbol € represents set membership, for 
example, x © S means that x is an element of S; and x € S means that x is 
not a member of S. The symbol C represents subset containment, for example, 
T C S means that T is a subset of S. 

There are three ways to define a set with formal mathematical notation: 


1, By listing its elements, as in S = (x), X2, X3, x3} or T= {{1, —1}, {2, —2}, 
{3, —3}, (4, —4}, {5, —5}, ...}-—implicitly, T is the set of all pairs of a 
positive integer and its negative. 

2. By a defining property, as in P = {p | p is a person taking this course} 
or R = { r| there exist integers s and ¢ with t # 0 and r = s/t}—R is the 
set of all rational numbers; 


3. As the result of some operation(s) on other sets (see below). 


There are two special sets we often use: the empty, or null, set, written 
@; and the universal set of all objects currently under consideration, written 
OU. 

It is important to bear in mind that in many real-world problems, sets 
cannot be precisely defined or enumerated. For example, the set of all ways in 
which a large computer program can fail is ill defined. A census of the population 
of the United States involves substantial error for several important subcategor- 
ies of the populace, yet the official United States population was given in 1990 
as 248,709,873. Beware of any calculations based on imprecise sets! 
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The three basic operations on sets that we will use are 


1. The intersection of S and T,S QT = {x € U| x ES andx € T}. 
2. The union of S and T,S UT = {x © U|x E€Sorx ET}. 
3. The complement of S,S = {x € U|x € S}. 


A fourth operation that is sometimes useful is 
4. The difference of S of minus T,S — T = {x E U|x ES andx € T}. 


Observe that S — T can be expressed in terms of the preceding operations as 
S-T=SQT. 


Example 1 


A sample of eight brands of calculators consists of four machines that have 
rechargeable batteries and four that do not. Also four out of the eight have 
memory. We want to select four calculators to be taken apart for thorough 
analysis. Half of this set of four should be rechargeable and half should have 
memory. How many ways can such a set of four machines be chosen from the 
eight brands? 

To answer this question, we must know how many machines are rechargea- 
ble and have memory, how many are rechargeable and have no memory, and 
so on. That is, if R is the set of rechargeable machines and M the set of machines 
with memory, then we need information about the sizes of the intersections 
N(R MM), NR OM), N(R MM), and NR MM). Assume that there are 
two machines in each of these four subsets. See Figure Al.1. Then one Strategy 
to get the desired mix of four machines would be to pick one machine from 
each category in Figure Al.1 (2 X 2 X 2 X 2 = 16 choices). There are two 
other strategies for getting the four machines from Figure A1.1 (see Exercise 
4 at the end of this section). = 


The study of set expressions involving the foregoing set operations and 
associated laws is called Boolean algebra. Three of the most important laws 
of Boolean algebra are 


No 
Memory Memory 


Rechargeable 


Not 
Rechargeable 


Figure Al.1 
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To visualize set expressions, we use a picture called a Venn diagram. 
Figure A1l.2a shows a Venn diagram for the sets S and T. The whole rectangle 
represents the universe UW of all elements under consideration. The circles 
represent the sets S and T. In Figure A1.2b, the darkened region represents 
S 1 T and the striped area (S U T) — (S NT). Note that in particular problems, 
certain regions in a Venn diagram may be empty sets. 

The next example illustrates how laws BA2 and BA3 can simplify a count- 
ing problem. 


Example 2 


Consider the universe U of all 52? outcomes obtained by picking one card from 
a deck of 52 playing cards, replacing that card, and then again picking a card 
(possibly the same card). Suppose that we want to compute the size of the set 
Q of outcomes with at least one spade or at least one heart. 

Let S be the set of outcomes with a void in spades, that is, in which no 
spade is chosen on either pick. Let H be the set of outcomes with a void in 
hearts. Write the set Q as an expression in terms of S and H, and calculate 
N(Q). 

The set Q equals § U H, the union of the set of outcomes with one or 
more spades (not a void in spades) and the set of outcomes with one or more 
hearts. The set S U H is the shaded area in Figure Al.3. By BA3, S UH = 
SH, where S  H is the set of outcomes with no spades and no hearts (§ M 
H is the unshaded region in Figure A1.3). That is, S N H is the set of outcomes 
where each pick is one of the 26 diamonds or clubs. Thus N(S N H) = 26°. 
By Figure Al.2, N(S OH = NU) — N(S N A) = 52? — 26? = 2704 — 676 
= 2028, and so N(S U H) = N(SN HA) = 2028. = 


At the elementary level, mathematical logic is very similar to set theory. 
Instead of sets, logic deals with statements or propositions. Statements are 
usually represented by lowercase letters. They have the value true (T) or false 
(F). The three principal operations on statements are as follows: 


1. The conjunction of p and q, p /\ q. Statement p /\ q is true if both p and 
q are true. 


Figure Al.2 (a) 
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Figure Al.3% 


2. The disjunction of p and q, p \/ q. Statement p \/ q is true if p or q (at 
least one) is true. 


3. The negation of p, ~p. Statement ~p is true if p is false. 


If p is ‘‘this integer is even’’ and q is ‘‘this integer is less than 25,’’ then p /\ 
~gq is ‘‘this integer is even and is not less than 25.’’ A truth table is used to 
enumerate the value (7 or F’) of the foregoing operations for each possible pair 
of values of p and q. 


Pq pq p\q ~p 
EF ‘f r F 
a a t F 
F T  F T T 
F FF F f 


The study of expressions built from such logical operations is called propo- 
sitional calculus. The three axioms of propositional calculus corresponding to 
laws BAI, BA2, BA3 above are 


PCl. ~~p=p 
PC2. ~(p/\q) = ~p\/ ~@ 
PC3. ~(p\/ gq) = ~p/\~@ 


With each statement p, one can associate a set S, of all elements (in a given 
universe) that satisfy statement p. Conversely, to each set S, one can associate 
a statement ps, ‘‘this object is an element in S.”’ 

Note that if the values of T and F are replaced by 1 and 0, respectively, 
then computer logic design, which uses ‘‘AND,’’ ‘‘OR,’’ and “‘NOT,’’ gates, 
is seen to be a problem in propositional calculus. 

Set theory, and more generally, nonnumerical mathematics, were studied 
little until the nineteenth century. G. Peacock’s Treatise on Algebra, published 
in 1830, first suggested that the symbols for objects in algebra could represent 
nonnumeric entities. A. De Morgan discussed a similar generalization for alge- 
braic operations a few years later. G. Boole’s Investigation of the Laws of 
Thought (1854) extended and formalized Peacock’s and De Morgan’s work to 
present a formal algebra of sets and logic. The great philosopher mathematician 
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Bertrand Russell has written ‘‘Pure Mathematics was discovered by Boole.”’ 
Subsequently it was found that numerical mathematics also needs to be defined 
in terms of set theory in order to have a proper mathematical foundation (see 
Halmos’s Naive Set Theory). See Chapter 26 of Boyer’s History of Mathematics 
for more details about the history of set theory. 


EXERCISES 


Mh 1. Let A be the set of all positive integers less than 30. Let B be the set of 


all positive integers that end in 7 or 2. Let C be the set of all multiples of 
3. List the numbers in the following sets: 


(a AN(BNC) (b) AN(BUC) () AN(BUC) (d) A-(BNC) 


2. If A = {1, 2, 4, 7, 8}, B = {1, 5, 7,9}, and C = {3, 7, 8,9} and U = 
{1, 2, 3, 4, 5, 6, 7, 8, 9, 10}, then find set expressions (if possible) equal 


to: 
(a) {7} (c) {1} (e) {4, 8, 10} 
(b) {6, 10} (d) {2, 7, 9} (f) {3, 5, 6, 7, 9, 10} 


3. Suppose that in Example 1 we were given only the additional information 
that two calculators have memory but are not rechargeable. Show that we 
can now deduce how many machines are in each of the other three boxes 
in Figure Al.1. 


4. What are the other two strategies in Example 1 to pick a subset of four 
calculators, two of which are rechargeable and two of which have memory, 
given the numbers in Figure Al.1? How many choices are there for each 
of these ways? 


5. Suppose we are given a group of 20 people, 13 of whom are women. 
Suppose 8 of the women are married. In each of the following cases, tell 
whether the additional piece of information is sufficient to determine the 
number of married men. If it is, give this number. 


(a) There are 12 people who are either married or male (or both). 
(b) There are 8 people who are unmarried. 
(c) There are 15 people who are female or unmarried. 
6. Label each region of the Venn diagram in Figure A1l.2a with the set 


expression it represents (e.g., the region where both sets intersect would 
be labeled A M B). 


7. Draw Venn diagrams and shade the area representing the following sets: 
(a) A-~B- (b)AUB_ (ce) AUB)N(NB) (dd) A-(B-A) 


8. Write as simple a set expression as you can for the shaded areas in the 
following Venn diagrams. 
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16. 


17, 


Ss 
\ < 


\ 


\ 


sx 


XV 


. Create a Venn diagram for representing sets A, B, and C. The diagram 


should have regions for all possible intersections of A, B, and C. How 
many different regions are there? Label each region with the set expression 
it represents (e.g., the region where all three sets intersect would be labeled 
ANBNC). 


. Draw Venn diagrams for sets A, B, and C and shade the areas representing 


the sets in Exercise 1. 


. Verify with Venn diagrams the following laws of Boolean algebra: 


(a) (AUB)NA=A (dd) AUB) NCHANC)U(BNC) 
(b) (AN B)UA=A () ANB)UC=AUOQN (BUC) 
() AUA=% 


. Use Venn diagrams to determine which pairs of the following set expres- 


sions are equivalent. 
(a) (AN B)U(ANB) (c) (A — B)N (AUB) 
(b) A — (B-— (A- B)) (d) (AN (B - A)) 


. Use Law BA2 to prove that AN BNC =A UB UC [note that AN BN 


C=(ANB)NC=AN(BNC). 


. Use Law BA3 to prove thatA UBUC =ANMBMNC [note that A U B 


UC=(AUB)UC=AU(BUO). 


- Use Laws BAI, BA2, and BA3 plus the identities AM A= A,AUA= 


A,AMB=BQA, and A U B = B UA to simplify the following set 
expressions: 


(a) (AUB) N (ANB) (c) AUQN(AUB)N (BUC) 
(b) A — (B U A) (d) (AU B)N[(C UB) - ANB) 


Suppose two dice are rolled. How many outcomes have a 1 or a 2 showing 
on at least one of the dice? 


Suppose that a card is drawn from a deck of 52 cards (and replaced before 
the next draw) three times. Let S be the set of outcomes where the first 
card is a spade, H be the set where the second card is a heart, and C be 
the set where the third card is a club. For each of the following sets E, 
write E as a set expression in terms of S, H, C, and determine N(E). 


(a) Let E be the set of outcomes contained in at least one of the sets S, 
H, C. 
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(b) Let E be the set of outcomes contained in at least one but not all three 
of the sets S, H, C. 


(c) Let E be the set of outcomes contained in exactly two of the sets S, 
H,C. 


18. For what values (T or F’) of propositions p and q are each of the following 
statements true? 


(a) ~(p V ~@) (ce) (pV ~g) \(~p Nq) 
(b) [I~pV QI VI[I-~AQq)l (d) ~(¢/\ q) 


19. For what values of propositions p, g, and r are each of the following 
statements true? 


(a) (p/\q) VV ~r © [~PYVQMIAl-@vV NI 
(b) (~p \ ~q) V ~r d@) (pPAQVPVNIA Hav ~r) 


20. Build a statement using propositions p, g, and r that is true only when 
exactly two of the three propositions are true. 


21. Two statements involving propositions p,, Px, . . . , py are equivalent if they 
are true for exactly the same sets of values of the propositions. How many 
different (nonequivalent) statements involving k propositions are there? 


22. Let a,a, be the binary digits in the binary representation of a number a, 0 
=a=3 (eg., if a = 2, then a, = 1 and a, = 0). Let bb, be similar 
binary digits for b, 0 = b = 3. Now let c = a + b, where c’s binary digits 
are C,C)C3. Treating 1 as true and 0 as false, express c,, c, and c; as 


propositional statements in terms of a,, a), b,, and by. 


A.2) MATHEMATICAL INDUCTION 


The most useful, and simplest, proof technique in combinatorial mathematics 
and computer science is mathematical induction. Let p, denote a statement 
involving n objects. Then an induction proof that p, is true for all n = 0 requires 
two steps: 


1. Initial step: Verify that py is true. 


2. Induction step: Show that if po, pi, Po, ..., Pr; are true, then p, must be 
true. 


Sometimes p, will be true only for k = n. Then the initial step is to verify 
that p, is true, and in the induction step we assume only that p,, Preis + +> Dna 
are true. 


Example 1 


Let s, denote the sum of the integers 1 through n, that is, s, = 1+2+3+ 
-++ +n. Show that s, = $n(n + 1). 
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We use mathematical induction to verify this formula. Since so is not 
defined, the initial step is to verify the formula for s,. The formula says s, = 
31(1 + 1) = 1—obviously correct. For the induction step, we assume that the 
formula is true for 5, 5, ..., S,-1. In this problem (as in most induction 
problems), we need only to assume that the formula is true for s,-}. 


Sr = 1L+24+3 40+ +-1)=Hn - YV[(m- 1) 4+ 1) =3- In 
We now use this expression for s,-, to prove that the formula is true for s,. 
S=(L+2+-+-+(m-)Djtn=s,-,+n 
=Hn—-1)nt+n 
= al(n — 1)n + 2n] = gn(n + 1) 


This completes the induction proof that s, = n(n + 1) for all positive n. = 


Example 2 


Suppose that the population of a colony of ants doubles in each successive 
year. A colony is established with an initial population of 10 ants. How many 
ants will this colony have after n years? 

Let a, denote the number of ants in the colony after n years. We are given 
that a) = 10. Since the colony’s population doubles annually, then a, = 20, 
a, = 40, and a; = 80. Looking at the first few values of a,, we are led to 
conjecture that a, = 2” X 10. For the initial step, we check that a = 10 = 2° 
x 10, as required. For the induction step, we assume that a,_, = 2”! X 10. 
Now we use the annual doubling property of the colony: 


An = 2G,-, = 2(2"' X 10) = 2" X 10 


Example 3 


A prime number is an integer p > 1 that is divisible by no other positive integer 
besides 1 and p. Show that any integer n > 1 can be written as a product of 
prime numbers. 

We prove this fact by mathematical induction. Since the assertion concerns 
integers n > 1, the initial step is to verify that 2 can be written as a product 
of prime numbers. But 2 is itself a prime. Thus 2 is trivially the product of 
primes (i.e., actually of a single prime), itself. Next assume that the numbers 
2, 3,...,— 1 can be written as a product of primes, and use this assumption 
to prove that can also be written as a product of primes. If 7 is itself a prime, 
then as in the case of 2, there is nothing more to do. Suppose 7 is not a prime, 
and so there is an integer m that divides nm and for some integer k, n = km. 
Since k and m must be less than n, they can each be written as a product of 
primes. Multiplying these two prime products for k and m together yields the 
desired representation of n as a product of primes. = 
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In his classic work Fundamental Algorithms, The Art of Computer Program- 
ming, Vol. I (Addison-Wesley, Reading, Mass., 1975), D. Knuth refers to 
mathematical induction as the ‘‘algorithmic proof procedure.’’ The simplest 
way to verify the formulas in Examples | or 2 (or the assertion in Example 3) 
with a computer for a particular value of n, say, n = 50, would be to write a 
program that recursively computes values for successive n up to n = 50. For 
Example 1, the program would be 


s<—0: 
FoORN = 1T050,00s<sin; 


A program to determine the factorization into primes of each successive integer 
up to 50 would similarly mimic mathematical induction. Thus, induction repre- 
sents a generalized case-by-case verification of a formula or assertion. 

The relationship between mathematical induction and algorithms is a two- 
way street. To verify that an algorithm or program works correctly, one usually 
uses some type of induction argument, based on the value of an input variable 
or on the number of iterations of a loop. 

Although the Greeks used certain iterative arguments in geometric calcula- 
tions and the principle of reductio ad absurdum, mathematical induction was 
first used explicitly by Maurolycus around 1550. Pascal used an induction 
argument in 1654 to verify the additive property of binomial coefficients in 
the array now known as Pascal’s triangle [this property is identity (3) in Section 
5.5]. The actual term induction was coined by De Morgan 200 years later. See 
Bussey, ‘‘Origin of mathematical induction,’’ American Math. Monthly, 1917, 
for a fuller discussion of the history of mathematical induction. 


EXERCISES 


Ms 1. Prove by induction that 1 + 3 +--+ + (Qn+1)=(@+4 1). 


n(n + 1)(2n + 1) 


2. Prove by induction that 17+ 2?+---+7r = 6 
3. Prove by induction that —1?7 + 2? — 3? + +++ + (-1)'W = 
n(n + 1) 
-—1 Ts SN es 
(-1) 5) 
4, Prove by induction that (1 X 2) + (2 X 3) +--+ + nv + 1) = 
n(n + 1)\(n + 2). 
3 
2 2: 
5. Prove by induction that 17+ 23 +---+n= many 
6. Prove by induction that (1 +2+4+---+nP=24+234---4 2 


(assume the result in Example 1). 
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: : 1 1 1 n 
+ rae ee ree + Neat at + - . 
7. Prove by induction that 1x2 *9x3 Per ren ere | 
; : 1 1 1 
: —— + —— + 
8. Prove by induction that ia ax 7X10 + 
ae eee 
(3n—2)X (3n+1) 3n+4+1 
9. Prove by induction that (1 X 1!) + (2 KX 2!) +--+ + (Xn) = 
we Dt — 1, 
— ntl 
10. Prove by induction that for a # 1, 7 — =lt+tat+@t---ta". 
11. Prove by induction that 17 + 3? + 5? + +++ + (Qn - 1 = 


20. 


21. 


22. 


n(2n — 1)\(2n + 1) 
3 


. If the number a, of calf births at Dr. Smith’s farm after n years obeys the 


law a, = 3a,-; — 2a,-. and in the first two years a, = 3 and a, = 7, then 
prove by induction that a, = 2"*! — 1. 


. Prove that any positive integer has a unique factorization into primes. You 


may assume the result of Example 3; you need to use the fact that if a 
prime p divides a product of positive integers, then p divides one of the 
integers. 


Write a computer program to find and print the prime factorizations of the 
first 50 integers. 


. Prove by induction that for any integer m > 0, m X n = n X m. By r X 


s, we mean the sum of r copies of s. 


. Prove by induction that for integers n = 5, 2" > n’. 
. Prove that the number of different subsets (including the null set and full 


set) of a set of n objects is 2”. 


. Prove by induction that A, A,N...NA, = A, U A, UU A, (see 


Exercise 13 in Appendix A1). 


. Prove by induction that if n distinct dice are rolled, the number of outcomes 


where the sum of the faces is an even integer equals the number of outcomes 
with an odd sum. 


Prove by induction that the number of n-digit binary sequences with an 
even number of 1s equals the number of n-digit binary sequences with an 
odd number of Is. 

Prove by induction that the sum of the cubes of three successive positive 
integers is divisible by 9. 

The cat population in a dormitory has the property that the number of cats 


in one year is equal to the sum of the number of cats in the two previous 
years. If in the first year there was one cat and if in the second year there 
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were two cats, then prove by induction that the number of cats in the nth 
year is equal to 


1 
vale + 1y/5 yn"! (3 1y/5 yr], 
23. Why cannot one prove by induction that the number of binary sequences 


of all finite lengths is finite? 


24, What is wrong with the following induction proof that all elements x,,..., 
xX, in a set of n elements are equal? 


(a) Initial step (n = 1): The set has one element x, equal to itself. 


(b) Induction step: Assume x, = x = +++ = x,-). Since also x,_; = x, by 
induction assumption (when x,-), x, are considered alone as a two- 
element set), then x} = x. = -+°* = X,-) =X. 


25. What is wrong with the following induction proof that for a # 0, a” = 1? 
(a) Initial step (n = 0): a® = 1—always true. 


n-lqn-l 
qn 


(b) Induction step: Assume a"! = 1 and now a" = =1. 


A233 A LITTLE PROBABILITY 


Historically, counting problems have been closely associated with probability. 
Indeed, any problem of the form ‘‘how many hobbits are there who ...’’ has 
the closely related form ‘‘what fraction of all hobbits ...,’? which in turn can 
be posed probabilistically as ‘‘what is the probability that a randomly chosen 
hobbit . . .?’’ The famous Pascal’s triangle for binomial coefficients was devel- 
oped by Pascal around 1650 in the process of analyzing some gambling probabil- 
ities. The probability of getting at least 7 heads on 10 flips of a fair coin, the 
probability of being dealt a five-card poker hand (from a well-shuffled deck) 
with a pair or better, and the probability of finding a faulty calculator in a 
sample of 20 machines if 5 percent of the machines from which the sample 
was drawn are faulty—all these probabilities are essentially counting problems. 

Two hundred years ago, the French mathematician Laplace first defined 
probability as follows: 


number of favorable cases 


Probability = 
ee total number of cases 


This definition corresponds to the ‘‘person in the street’s’’ intuitive idea of 
what probability is. In this text, we treat probability problems only where 
Laplace’s definition of probability applies. Implicit in this definition is the 
assumption that each case is equally likely. If the total number of cases is 
infinite, for example, all real numbers between 0 and 1, then we would not be able 
to use Laplace’s definition. A more subtle difficulty is that in some probability 
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problems, ‘‘cases’’ have to be carefully defined or else they may not all be 
equally likely, as, for example, the possible numbers of heads observed when 
a coin is flipped 10 times. To clarify this difficulty, we need to introduce a 
little of the basic terminology of probability theory. 

An experiment is a clearly defined procedure that produces one of a given 
set of outcomes. These outcomes are called elementary events and the set of 
all elementary events is called the sample space of the experiment. An event 
that is a subset of several elementary events is called a compound event. 

For example, when a single die is rolled, then obtaining a specific number, 
such as 5, is an elementary event, whereas obtaining an even number is a 
compound event. If S is the sample space of the experiment and E is an event 
in S, then Laplace’s definition of probability says that the probability of event 
E, prob(E), is 

_ NE) 
prob(£) N(S) 

In most instances, the size of S is easily determined, and so the problem 
of determining prob(£) reduces to counting the number of outcomes (elementary 
events) in event &. Returning to the die roll, the sample space of outcomes is 
S = {1, 2, 3, 4, 5, 6}. If the event E is obtaining an even number, then E = 
{2, 4, 6}, and prob(Z) = 3 = 4. 

Many experiments we discuss involve a repeated (or simultaneous) series 
of simple experiments. Each round of the simple experiment is called a trial. 
For example, the experiment of flipping a coin three times involves three . 
successive trials of the simple experiment of flipping a coin. Rolling two dice 
and recording the sum of the two values rolled involves the two simultaneous 
simple experiments of rolling a single die. In any experiment involving multiple 
trials, the elementary events are the sequences of outcomes of the simple 
experiments. If the simple experiments are performed simultaneously, we num- 
ber the simple experiments and list their outcomes in order of the experiments’ 
numbers. 

The sample space of elementary events for the experiment of flipping a 
coin three times is 


{HHH, HHT, HTH, HTT, THH, THT, TTH, TIT} (1) 
The sample space for the experiment of rolling two dice is 
S={@f)|lsis6,1sjs6} 


Let us consider more closely the experiment of rolling two dice and 
recording the sum of the two values on each die (the sample space is the set 
S listed above). 

In terms of our formal definition of events, the sum of the two values 
equaling k is not an elementary event but rather a compound event. The event, 
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the sum of the two dice equals 7, is the subset of elementary events 
S, = {(1, 6), (2, 5), (3, 4), (4 3), G, 2), 6 DI 


Thus, prob(sum = 7) = N(S,)/N(S) = 3s. Similarly, S; = {(1, 1)} and so 
prob(sum = 2) = N(S)/N(S) = 1/36. Observe that a sum of 7 is six times as 
likely as a sum of 2. If we had considered the possible sums of the two dice 
as the elementary events, we would have had a sample space $* = {2, 3, 4, 
5, 6, 7, 8, 9, 10, 11, 12} and each sum would have had probability 1/N(S*) = 
1/11—clearly a mistake. The same type of error would have been made if we 
were to regard the number of heads as the elementary events in the experiment 
of flipping three coins [see the correct sample space listed above in Eq. (1)]. 

Suppose we have a box containing 5 identical red balls and 20 identical 
black balls. Our experiment is to draw a ball. What is the sample space of 
elementary events? All we can record as an outcome is the color of the ball, 
leading to the sample space S = {red, black}. But then by Laplace’s definition of 
probability, picking a red ball will have probability 3. Although this experiment 
consists of just a single trial, we are clearly making the same sort of mistake 
that occurs when the sums of dice are treated as elementary events. The reader 
is encouraged to pause a moment and try to supply his or her own explanation 
for resolving this red—black ball paradox before reading the next paragraph. 

The logical solution is to consider not what color ball we withdraw from 
the box but where in the box we direct our hand to grasp a ball. There are 25 
different spatial positions where balls are located in the box. So, in this sense, 
there are 25 different events. We resolve this complication about ‘‘identical’’ 
objects with the following rule. 


Identical Objects Rule 


In probability problems, there are no collections of identical objects; al! objects 
are distinguishable. 


In complex experiments one should always take a moment to be sure that 
the elementary events are properly identified. 

The original motivation for studying probability was the same as used in 
this book, games of chance—rolling dice, flipping coins, and card games. 
Cardano calculated the odds for different sums of two dice in the 1500s. Around 
1600, Galileo calculated similar odds for three dice. A series of letters about 
gambling probabilities between Pascal and Fermat, written around 1650, consti- 
tute the real beginning of probability theory. Jacques Bernoulli’s Ars Conjec- 
tandi (1713) was the first systematic treatment of probability (and associated 
combinatorial methods). One hundred years later, Laplace published his epic 
treatise Theorie Analytique des Probabilities, a work containing both the defini- 
tion of probability in a finite sample space used in this book and also advanced- 
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calculus-based derivations of modern probability theory. See David, Games, 
Gods, and Gaming (Hafner, 1962) for an excellent history of probability theory. 


EXERCISES 


Mb 1. An integer between 5 and 12 inclusive is chosen at random. What is the 


2. 


4 


10. 


probability that it is even? 


In the experiment of flipping a coin three times, let E, be the compound 
event that the number of heads equals k. Determine prob(E,), for k = 0, 
1, 2, 3. 


. In the experiment of rolling two distinct dice, find the probability of the 


following events: 

(a) Both dice show the same value. 

(b) The sum of the dice is even. 

(c) The sum of the dice is the square of one of the die’s value. 

A die is rolled three times. What is the probability of having a 5 appear 
at least two times? 


Find the probability that in a randomly chosen arrangement of the letters 
h, a, t, the following occurs: 


(a) The letters are in alphabetical order. 
(b) The letter A occurs somewhere after the letter ¢ in the arrangement. 


. Find the probability that in a randomly chosen (unordered) subset of two 


numbers from the set 1, 2, 3, 4, 5 the following occurs: 
(a) The subset is {1, 2}. 
(b) 1 is not in the subset. 


. An urn contains six red balls and three black balls. If a ball is chosen, then 


returned, and a second ball chosen, what is the probability that one of the 
following is true? 


(a) Both balls are black. 
(b) One ball is black and one is red. 


. An urn contains two black balls and three red balls. If two different balls 


are successively removed, what is the probability that both balls are of the 
same color? 


. Two boys and two girls are lined up randomly in a row. What is the 


probability that the girls and boys alternate? 


Five distinct dice are rolled. What is the probability of getting at least one 
6? 
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11. If a school has 100 students of whom 50 take French, 40 take Spanish, 
and 20 take French and Spanish, then what is the probability that a randomly 
chosen student takes no language? 


*12. What is the probability that an integer between 1 and 50 inclusive is 
divisible by 3 or 4? 

13. There are three urns each with two balls: One urn has two black balls, the 
second has two red balls, and the third a black and a red ball. If an urn is 
randomly chosen and a randomly chosen ball in that urn is red, what is 
the probability that the other ball in this urn is also red? (Hint: Let the 
sample space be the set of all experimental outcomes where a red ball is 
chosen first.) 


14, What should the sample space be for the following problem: An urn has 
10 black balls and 5 red balls; 4 balls are removed but not seen (our eyes 
are shut) and then a fifth is removed and observed; now what is the 
probability that the fifth ball is red? 


15. What should the elementary events be in each of the following experiments 
so that they are equally likely? 


(a) A fair coin is tossed until two heads occur or until the coin is flipped 
10 times. 


(b) A positive integer is chosen at random. 


(c) Two positive integers are chosen at random and their difference is 
recorded. 


(d) A ball is randomly chosen from an urn with two red and two black 
balls and this ball is replaced and a ball chosen again, if necessary, 
until a red ball is obtained. 


A.4. THE PIGEONHOLE PRINCIPLE 


The pigeonhole principle is one of the most simpleminded ideas imaginable, 
and yet its generalizations involve some of the most profound and difficult 
results in all of combinatorial theory. This topic is included in these appendices 
because it does not fit naturally into any chapter of this book. More mathematical 
texts on combinatorics devote a whole chapter to Ramsey theory—for example, 
see Chapter 5 of Cohen’s Basic Techniques of Combinatorial Theory (John 
Wiley, 1978). For a thorough treatment of Ramsey theory, see Graham, Roth- 
schild, and Spencer, Ramsey Theory (John Wiley, 1990). 


Pigeonhole Principle 


If there are more pigeons than pigeonholes, then some pigeonhole must contain 
two or more pigeons. More generally, if there are more than k times as many 
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pigeons as pigeonholes, then some pigeonhole must contain at least k + 1 
pigeons. 


This principle is also called the Dirichlet drawer principle. An application 
of it is the observation that two people in New York City must have the same 
number of hairs on their heads. New York City has over 7,300,000 people and 
the average scalp contains 100,000 hairs. Indeed, the pigeonhole principle 
allows us to assert that it is theoretically possible to fill a subway car with 
about 73 New Yorkers all of whom have the same number of hairs on their 
head. 

The following three problems suggest some of the diverse generalizations 
of the pigeonhole principle. 


1. How large a set of distinct numbers between 1 and 200 is needed to assure 
that two numbers in the set have a common divisor? 


2. How large a set of distinct numbers between 1 and n is needed to assure 
that the set contains a subset of five equally spaced numbers a), a, a3, a4, 
as that is, ay — a; = A; — G) = ay — ay = as — a4? 

3. Given a positive integer k, how large a group of people is needed to assure 
that either there exists a subset of k people in the group who all know each 
other or there exists a subset of k people none of whom know each other? 


In Problem 3, it might be possible that for large values of k, there are 
groups of arbitrarily large size that do not meet the k mutual friends or k mutual 
strangers property. Such is not the case. One of the theorems of Ramsey theory 
is that there exists a finite number r, such that any group with at least r;, people 
must satisfy this property. The theorem does not say what number 7; is. Actually, 
r, has been determined for only a few small values of k. 

Most generalizations of the pigeonhole principle require special, research- 
level skills rather than the combinatorial problem-solving logic and techniques 
that this text seeks to develop. However, we include the basic pigeonhole 
principle in this text because it can occasionally be very helpful. The following 
examples illustrate an obvious and two not so obvious uses of the principle. 


Example 1 


There are 20 small towns in a region of west Texas. We want to get three 
people from one of these towns to help us with a survey of their town. If we 
go to any particular town and advertise for helpers, we know from past experi- 
ence that the chances of getting three respondents is poor. Instead, we advertise 
in a regional newspaper that reaches all 20 towns. How many responses to our 
ad do we need to assure that the set of respondents will contain three people 
from the same town? 

By the pigeonhole principle, we need more than 2 X 20 = 40 responses. = 
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Example 2 


We have 15 minicomputers and 10 printers. Every five minutes, some subset 
of the computers requests printers. How many different connections between 
various computers and printers are necessary to guarantee that if 10 (or fewer) 
computers want a printer, there will always be connections to permit each of 
these computers to use a different printer? 

Using a variant of the pigeonhole principle, we see that there must be at 
least 60 connections. Otherwise one of the 10 printers, call it printer A, would 
be connected to 5 (or fewer) computers and if none of the 5 computers connected 
to A were in the subset of 10 computers seeking printers, then printer A could 
not be used by any of these 10 computers. It happens that exactly 60 connections, 
if properly made, will solve the connection problem (see Exercise 13). m= 


Example 3 


Show that any collection of 8 positive integers whose sum is 20 has a subset 
summing to 4. 

We show that the collection must have one of the following four subsets 
with a sum of 4: (a) four 1s; (b) two 2s; (c) two 1s and a 2; (d) a 1 and a 3. 
That one of these possibilities must hold is a pigeonhole—type argument. 

If S contained no 1 or 2, then its sum would be at least 3 X 8 = 24. So 
by the pigeonhole principle, $ must contain a 1 or a 2., 

\Suppose contains a 2. We are finished if there is a second 2 or if there 
are also two Is. (Possibly there is one 1, but all other integers are at least 3, If 
there is no 1, the other 7 integers, each at least 3, must sum to 20 — 2 = 18— 
impossible by a pigeonhole principle—type argument. If there is one 1, the other 
6 integers, each at least 3, must sum to 20 — 2 — 1 = 17—again impossible. 

Suppose S$ contains at least one 1 but no 2. If there is a 3 in S, case (d) 
applies. If there are four 1s in S, case (a) applies. The alternative is no 2s and 
no 3s in S and at most three 1s. Then there are at least five other integers in 
S of size at least 4 whose sum must be less than 20—impossible. m= 


EXERCISES 


Me 1. Given a group of m women and their husbands, how many people must be 


chosen from this group of 2 people to guarantee the set contains a married 
couple? 


2. Show that at a party of 20 people, there are 2 people who have the same 
number of friends. 


3. In a round-robin tournament, show that there must be 2 sees with the 
same number of wins if no player loses all matches. 
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4. 


13. 
14. 


15. 


16. 


Given 10 French books, 20 Spanish books, 8 German books, 15 Russian 
books, and 25 Italian books, how many books must be chosen to guarantee 
there are 12 books of the same language? 


. If there are 48 different pairs of people who know each other at a party 


of 20 people, then show that some person has 4 or fewer acquaintances. 


. A professor tells three jokes in her ethics course each year. How large a 


set of jokes does the professor need in order never to repeat the exact same 
triple of jokes over a period of 12 years? 


. Show that given any set of seven distinct integers, there must exist two 


integers in this set whose sum or difference is a multiple of 10. 


. Suppose the numbers | through 10 are randomly positioned around a circle. 


Show that the sum of some set of three consecutive numbers must be at 
least 17. 


. A computer is used for 99 hours over a period of 12 days, an integral 


number of hours each day. Show that on some pair of 2 consecutive days, 
the computer was used for at least 17 hours. 


. Show that any subset of eight distinct integers between 1 and 14 contains 


a pair of integers k, / such that k divides /. 


Show that any subset of n distinct integers between 2 and 2n (n = 2) 
always contains a pair of integers with no common divisor. 


. Show that any set of 16 positive integers (not all distinct) summing to 30 


has a subset summing to n, forn = 1, 2,..., 29. 
In Example 2, find the required set of 60 computer—printer connections. 


There used to be 6 computers and 10 printers in a large computing center. 
Each computer was connected to some subset of printers. Now the 10 old 
printers are being replaced by 6 more reliable printers, but temporarily the 
computers will still be allowed to believe that there are 10 printers. Four 
dummy printers will pass on computer requests to the 6 other real printers. 
How many connections between the 4 dummy printers and the 6 real 
printers are needed to handle any set of 6 printing requests? 


Two circular disks each have 10 Os and 10 1s spaced equally around their 
edges in different orders. Show that the disks can always be superimposed 
on top of each other so that at least 10 positions have the same digit. 


A student will study basketweaving for at least an hour a day for seven 
weeks, but never more than 11 hours in any one week. Show that there is 
some period of successive days during which the student studies a total of 
exactly 20 hours. 


. Show that any sequence of n? + 1 distinct numbers contains an increasing 


subsequence of n + 1 numbers or a decreasing subsequence of n + 1 
numbers. 


. Show that at any party with at least six people, there exists either a set of 


three mutual friends or a set of three mutual strangers. 


GLOSSARY OF 
COUNTING AND GRAPH 
THEORY TERMS 


Arrangement: An arrangement is a sequence or ordered list of objects. An 
r-arrangement is an arrangement with r objects. An arrangement may or 
may not allow repetition of objects. There are P(n, r) r-arrangements 
without repetition of r objects chosen from n objects. There are n’ r- 
arrangements with repetition of r objects chosen from 7 types of objects. 


Binomial coefficient: A binomial coefficient is a coefficient in the polynomial 
expansion of a binomial expression such as (a + x)". The coefficient of x’ 


in (1 + x)" is written C(n, r) or : . This coefficient equals the number 


of distinct k-subsets of an n-set. 

Bipartite graph: A bipartite graph G = (X, Y, E) is a graph whose vertices 
are partitioned into two vertex sets, X and Y, and every edge in G joins a 
vertex in X with a vertex in Y. 


Chromatic number: The chromatic number of a graph is the smallest number 
of colors that can be used in a coloring of a graph (see Coloring a graph). 


Circuit: A circuit is a sequence of distinct vertices (x), X2, X3, . . . » Xn) Where 
x, = x,, and x; is adjacent to x;,,. A vertex may not appear more than once 
in a circuit (except for the same vertex as the starting and ending vertex). 


Coloring a graph: A coloring of a graph G assigns a color to each vertex 
so that adjacent vertices have different colors. 


Combination: A combination is a subset of objects or, equivalently, an unor- 
dered collection of objects. Objects may or may not be repeated in a 
combination. There are C(n, r) different combinations without repetition 
of r objects chosen from n objects. There are Cin +r — 1, r) different 
combinations with repetition of r objects chosen from n types of objects. 

Complete graph K, and complete bipartite graph K,,,: K, is a graph on n 
vertices with an edge joining every pair of vertices. K; is a triangle. K, is 
an edge. K,,, is a bipartite graph with m and n vertices in its two vertex 
sets and all possible edges between vertices in different sets. 
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Complementary graph or complement: The complementary graph G = 
(V, E) of a graph G = (V, E) has the same vertex set V as G does. A pair 
of vertices are joined by an edge in G if and only if they are not joined 
by an edge in G. 

Component: An unconnected graph G consists of a collection of components 
or ‘‘connected pieces.’’ A connected graph consists of a single component. 
Formally, a component of G is the largest possible connected subgraph of 
G containing some particular vertex. 


Connected graph: A graph is connected if there is a path joining any given 
pair of vertices. A directed graph is connected if it is connected when 
treated as an undirected graph (with all edge directions ignored). 


Cycle: A cycle is a sequence of consecutively linked edges whose starting 
vertex is the ending vertex and in which no edge can appear more than 
once. Unlike a circuit, a vertex can be visited any number of times in a 
cycle. 


Derangement: A derangement of a given arrangement or an ordered list of 
distinct objects is a rearrangement so that no object is in the same position 
it had in the original arrangement. 


Directed graph, directed edge: A graph is a directed graph if each edge 
(a; b) is directed, going from a to b. A directed graph may contain two 
opppositely directed edges joining two vertices, such as (a, b) and (b; a). 

Distribution: A distribution is an assignment of a given set of objects, which 
may be identical or distinct, to a set of distinct destinations. Unless explicitly 
prohibited, more than one object may go to the same destination. 


Edge cover: A set S of vertices such that every edge in a graph is incident 
to one vertex in S. 


Euler cycle, Euler trail: An Euler cycle (trail) is a cycle (trail) that contains 
all the edges in a graph. Further, it must visit each vertex at least once. 


Generating function: A generation function g(x) for a,, the number of ways 
to do some procedure with n objects, is a polynomial or power series with 
the expansion g(x) = ad) + ajx + ax? + +++ + a,x" + +++. Such a 
function is also called an ordinary generating function in contrast to an 
exponential generating function g(x), which has the form g(x) = dp + 


eee eee 

Graph: A graph G = (V, £) consists of a finite set V of vertices and a finite 
set E of edges. Each edge e = (a, b) joins two different vertices a, b (a 
# b). Also, two edges cannot join the same pair of vertices. Unless G is 
a directed graph (see Directed graph), (a, b) and (b, a) are the same edge 
(order does not matter). 

Hamilton circuit, Hamilton path: A Hamilton circuit (path) is a circuit 
(path) that contains every vertex of a graph. 
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Independent set: A set of vertices in a graph is independent if no pair of 
them are adjacent. 


Interval graph: Gis an graph interval G if there exists a collection of intervals 
on the line for which there is a one-to-one correspondence between the 
intervals and vertices of G so that two vertices are adjacent if and only if 
the corresponding intervals overlap. 


Isomorphism, isomorphic graphs: Two graphs G, and G, are isomorphic 
if there exists a matching, called an isomorphism, of vertices in G, with 
the vertices in G, so that two vertices in G, are adjacent if and only if the 
corresponding two vertices in G, are adjacent. Informally, two graphs are 
isomorphic if they are ‘‘the same graph’’ except that their vertices have’ 
different names. 


Map coloring: A map of countries is properly colored by assigning a color 
to each country so that countries with a common border get different colors. 


Matching: A matching in a bipartite graph G = (X, Y, E) is a subset of edges 
that pair off in a one-to-one fashion, some vertices in X with some verti- 
ces in Y, 


Multigraph: A multigraph is a generalized graph in which (1) multiple edges 
are allowed—two or more edges can join the same two vertices; and 
(2) loops are allowed—edges of the form (a, a). 


Network: A network is a graph, usually a directed graph, with a positive 
integer k(e) assigned to each edge e of the graph. 


Network flow: A flow is a function on the edges of a network that satisfies 
certain constraints listed at the start of Section 4.2. 


Partition: A partition of a collection of identical objects divides the objects 
into a collection of groups of various sizes. One can also speak of a partition 
of an integer 7 as a collection of positive integers that sum to n. 


Path: A path is a sequence of distinct vertices (x1, %2, %3,..., Xn) such that 
x; is adjacent to x;,;. A vertex may not appear more than once in a path. 


Permutation: A permutation of a set or sequence of objects is an arrangement 
of the set or sequence, normally with no repetition allowed. An r-permuta- 
tion is an r-arrangement of r objects chosen from the set or sequence. 


Planar graph: A graph is planar if there exists a way to draw it on a sheet 
of paper so that no edges cross. A plane graph is a planar graph drawn 
so that no edges cross. 


Recurrence relation: A recurrence relation is an equation of the form a, = 
2a,-; + 3a,-2, in which a, is the number of ways to do some procedure 
with k objects. 


Selection: A selection is an unordered collection of objects. Objects may or 
may not be repeated in a selection. There are C(n, k) different selections 
without repetition of k objects chosen from n objects. There are C(n + k 
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— 1, k) different selections with repetition of k objects chosen from n types 
of objects. 


Subgraph: A subgraph is a graph that is contained in another graph. If G’ 
= (V’, E') is a subgraph of G = (V, E), then V’ C V and E' CE. 

Trail: A trail is a sequence of consecutively linked edges in which no edge 
can appear more than once. Unlike a path, a vertex can be visited any 
number of times in a trail. 


Tree: A tree is a graph with a special vertex called a root and for each vertex 
x, other than the root, there is a unique path from the root to x. An undirected 
tree is characterized as a connected graph with no circuits. For tree-related 
terms, see the following Subglossary of Tree Terminology. 


Subglossary of Tree Terminology 


Ancestors: Ancestors of vertex x are the set of vertices on the path from the 
root to x. 


Backtracking search: See Depth-first search. 


Balanced tree: A tree with all leaves at level ) and h — 1, where h is the 
height of the tree. 


Binary tree: A tree in which each internal vertex has exactly two children. 


Breath-first search: From the root, find all vertices z adjacent to the root, 
then all vertices adjacent to one of the zs, and so on. 


Child: Children of vertex x are vertices y with an edge (x, y) from x to y. 


Depth-first search: Also known as backtracking search, builds a path from 
the root as far as possible; one backtracks from the current vertex when 
the path cannot be extended to a previously unvisited vertex and backs up 
the current path until a vertex is found at which a side path may be 
constructed to a new vertex. 


Descendant: Descendants of vertex x are the set of vertices z whose path 
from the root to z contains x. Alternatively, descendants of x are the set 
of vertices in the subtree rooted at x. 

Height: The largest level number in a tree. 

Inorder traversal: In a binary tree, starting from the root, this traversal 
recursively lists the vertices of the left subtree of a particular vertex x, then 
x, and then the vertices in the right subtree of x. 

Internal vertex: A vertex with children; internal vertex and parent are equiva- 
lent terms. 

Leaf: A vertex with no children; a leaf’s only incident edge comes in from 
its parent. 

Level or level number: The length of the path from the root to a given 


vertex; €.g., level 2 of a tree consists of all vertices whose path from the 
root has length 2. 
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m-ary tree: A tree in which all internal vertices have m children. 


Parent: The parent of vertex x is the unique vertex z with an edge (z, x) 
from z to x. 


Postorder traversal: Lists vertices in the order they are last encountered in 
a depth-first search of a tree. 


Preorder traversal: Lists vertices in the order they are first encountered in 
a depth-first search of a tree. 


Root: The special vertex in a tree with a unique path to any other specified 
vertex; in a directed or rooted tree, the root is the unique vertex at level 0 
that has no parent. 


Rooted tree: A directed tree with all edges directed away from the root. 
Sibling: Siblings of vertex x are those vertices with the same parent as x. 


Spanning tree: A tree that is a subgraph of a connected graph and that 
contains all vertices of the graph. 


Subtree: A connected subgraph of a tree. 
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SOLUTIONS TO ODD- 
NUMBERED PROBLEMS 


CHAPTER ONE SOLUTIONS 


Section 1.1 
1. (a) F (b) 2 (C, D), (C, E), 


D 82 c 
(c) yes, several routes. 3. (a) A 5-circuit. 5. (a) many possibilities, 
(b) min = 4, several possibilities, e.g., 7. (a) ©, 


(b) 4. 9. (a) no others, (b) {b, c}, {b, d}, {d, e}. 

11. (a) (a, b), (a, g) and (c, d), (d, e) and (c, d), (e, f) and (d, e), (e, f), 
(b) {b, g}, {b, ft, {c, fF}, {c, ef, fd, f}, (©) 3 (the edges incident to any vertex). 
13. (a) three matchings (see Exercise 14), 

(b) A and C each can fill only job b. 15. (a) {a-b, c-d} or {a-c, b-d}, 

(b) no possible (odd number of vertices), 

(c) a, c, e collectively must be matched with just b and e. 

19. (a) vertex = variety of chipmunk, if A splits into B and C, then make 
edges (A; B) and (A; C), (b) 7 splits. 

21. minimal block surveillance- 6, minimal corner surveillance- 3. 

23. (a) 5: squares (2, 4), (3, 4), (4, 4), (5, 4), (8, 4), (b) 10. 

25. (a) (i) 2, (ii) 2, (iii) 5, (b) W@W {a, a}, Gi) {b, c, e, f}, (iii) {DO, STORE}. 


27. only Figures 1.2 and 1.4. 29. (a) ‘ b a 
(b) a ua d (c) a z d 
c f i @ 
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31. 

33. (b) 4-circuit a-b-c-d-a: 4 _) € d 
—— d 

37. reduce C by 2 and B by 1. 39. (a) S (b) not 

possible. a \, 


aN 
JN, 
AS 


Section 1.2 


1 e e e e oe e e 
ig Plus complements 
of first 4 graphs 
e e o—e oe 


3. The degree-3 vertices in the right graph form a 4-circuit; in the left graph 
they do not. 5. (a) yes, a-1, b-5, c-4, d-3, e-2, f-6, 

(b) yes, a-6, b-1, c-3, d-5, e-2, f-4, 

(c) yes, a-3, b-4, c-7, d-1, e-6, f-8, g-2, h-5, i-9, 

(d) no, right graph has one more edge, 

(e) no, subgraphs of vertices of degree 3 do not match, 

(f) yes, a-1, b-2, c-3, etc. 7. Graphs 1-6, 13-18, 31-36, 37-42 mutually 
isomorphic, 7-12, 19-24, 25-30 isomorphic. 

9. no, building on the isomorphism in Example 2, b — c but 5 < 2. 

13. (a) two, (b) four, (c) one. 15. four in each. 


Section 1.3 


1. (a) 12, (b) 9, (ce) 8 or 10 or 20 or 40. 3.12. 

5. if v vertices, then e = up edges (where jv is an integer since p is 
odd). 7. sum of in-degrees (or out-degrees) = number of edges, since 
sum counts each edge once. 11. (a) no, (b) no, (ce) yes. 
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Section 1.4 
1. (a) 


3. (a) no, delete edges (c, d), (i, g) bipartition in K33: {a, h, j}, {b, e, f}, 
(b) no, delete (b, c), (e, f): {a, e, f}, {b, c, d}, 

(c) no, delete (b, /), (e, g): {a, d, h}, {c, f, i}, (d) no, many possible K3,, 
(e) no, delete (a, b), (b, c), (d, e), Gf, g): {a, d, e}, {c, f, gh, 

(f) no, delete (d, e): {a, c, e}, {b, f, zg}, (g) delete (f, g): {a, c, e}, {b, d, hh, 
(h) no, delete (a, b), (b, e), (c, f): {a, e, g}, {d, f, hy. 5. (a) n= 4, 

(b) rors S 2. 7. (a) possible, r = 8, (b) possible, e = 12, 

(c) not possible, v = 6, (d) not possible, r = 10, (e) possible, v = 7, 
(f) possible, e = 12, r = 8, (g) not possible (parity violation), 

(h) possible, e = 10, v = 7. 

9. (a) circuit length = sum of number of boundary edges of R, and R, — 2; 
(b) circuit length = sum of number of boundary edges of each enclosed 
region minus 2 X (number of edges interior to circuit). 

11. (a) K33 and K; are critical nonplanar. 13. (a) r=e-vtetl, 
(b) Using (a), corollary becomes e = 3v — 3c — 3 (S 3v — 6). 

17. If false, deg = 5 and so 5v = sum of degrees = 2e = 2(3V — 6) = 

6v — 12, that is, 5v = 6v — 12 or 12 S v—impossible. 

21. (a) If not triangular, an interior chord can be drawn; 

(b) 3r = sum of boundaries of regions = 2e > fe =r=e-vi2> 

e = 3v — 6 = 3n — 6 andr = $e = 3 (3n — 6). 23. v=p+ 2l,e= 
42Xdeg = 2p + 3, answer =r—-l=e-vt+t1l=pt+/+1. 

25. (a) degree of vertices in K; is 4 => degree of vertices in L(K;) is 2 X 
(4 — 1) = 6; L(Ks) has v = 10 and e = 32.deg = 3v = 30 > ee ¥ 3V — 6; 
(b) 


Supplement I 


1. vertex = committee, edge = committee overlap (person). Graph is K; 
with e = }Xdeg = 6v) = 21. 3. n = 12. 5. (a) yes, trace out any 
sequence of edges and eventually a vertex z will be repeated, between first 


and second visit to z a circuit is formed, (b) no, e.g., A\/h 
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7. use same argument as in 5(a). 9. Vertices in different components of 
G are directly adjacent in G, vertices in same component are joined in G by 
a path of length 2 via any vertex in other component of G. 11. if: a, b 
in different components (see Exer. 10) of G-x and so x’s removal 
disconnects G, only if: pick a, b in different components of G-x. 13. if: 
suppose not strongly connected with no path from a to b—let V, consist of 
a and all vertices that can be reached by a directed path from a, V2 is other 
vertices, only if: obvious. 15. (a) yes, see Exer. 7 in Section 1.2, 

(b) no, odd number of vertices of odd degree, 

(c) no; e = $2deg = 13 Se ¥ 3V — 6. 

17. (a) Repeatedly remove side circuits until trail from x to y has no 


repeated vertices. (b) | | (c) similar argument to part (a). 


y 
(d) If odd number of edges of cycle are partitioned into circuits [part (d)], 
then some circuit must have odd number of edges. 21. Most edges in a 
bipartite n-vertex graph occur if n/2 vertices in each part and all possible 
joining edges—K,.,,»—which has e = (gn)’ = 4n’. 23. (a) 5-circuit or 
3-edge path. (b) By hint, if G has n vertices, number of edges in G = 
3(number of edges in K,) = 3[3n(n — 1)] = in(n — 1). Since n and n — 1 
are not each divisible by 2, then either n or n — 1 must be divisible by 4, 
len = 4korn = 4k + 1. 25. if: obvious, only if: let x, be vertex 
with 0 out-degree (if no such vertex, there is a directed circuit—Exer. 5(a), 
move x, from graph and let x,-, be vertex with 0 out-degree in remaining 
graph, continue indexing in this fashion. 
27. (a) (b) each edge of K,, is incident to 


n — 2 other edges at each end vertex, 2(n — 2) incidences in all, (¢) can 
be no vertices of degree 0 or 1 and so all degrees = 2; since e = $ddeg, 
then to have e = v (so that G and L(G) have same number of vertices), all 


deg = 2 => G is any circuit. 29. >< 
31. 5 ZX 33. Consider a vertex y that 
1 7 
; é Ve 
6 
is 3 ‘ ey, 
3 
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beat x, [i.e., (y? x)]; if y beat every competitor that beat x, then y would 
have a greater score than x—not possible—and so for some w, (x, w) and 
(Ww; y). 35. Sketch of proof: show that each vertex in (C; U C,) — 
(C, M C,) has even degree and then repeatedly trace a path (without 
repeating any edge) until a vertex x is revisited and remove the circuit 
formed between the first and second visit to x. 


CHAPTER TWO SOLUTIONS 
Section 2.1 


1. (a) many possibilities, (b) many possibilities with b and fas end vertices. 
3. Yes, make degree even at all other vertices and degree will also have to 
be even at new vertex (or else graph would have only one odd-degree 
vertex, violating Corollary in Section 1.3). 

5. (a) No, once bridge crossed there is no way back to starting vertex, 

(b) many possibilities, e.g., a 10-edge path. 7. An isolated vertex added 
to a connected graph with even degrees now has a Euler cycle but is not 
connected. 9. Build a graph with a vertex for each racer and an edge for 
each race; a Euler trail corresponds to a sequence of races in which each 
racer is in two consecutive races; this graph has the desired Euler trail 
because only A and F have odd degree. 11. A set of deadheading edges 
must have one edge at each odd-degree vertex; joining these edges at odd- 
degree vertices together (without changing the parity of the degree of other 
vertices) requires a set of paths. 13. A directed graph has a Euler trail if 
and only if at all but two vertices indegree = outdegree and at those two, 
indegree and outdegree differ by one. Proof: add an extra edge so that 
indegree = outdegree at two unbalanced vertices and resulting graph has 
Euler cycle; remove added edge yielding desired Euler trail. — 

15. no such Euler cycle containing all vertices. 17. (a) yes, (b) if at 
every stage graph of remaining edges is connected, then just before using 
last edge E and being forced to stop at starting point, FE is only edge 
remaining and once taken there are no remaining edges, (c) applies to 
Euler trails. 19. (a) (b) concatenating 


the first digits of each node on an Euler cycle produces desired sequence, 
(c) many possibilities, e.g., cycle 00-00-01-11-11-10-01 (-00) produces 
00011101. 
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Section 2.2 


1. many possibilities, e.g., (a) <> , (b) <] 


3. a-g-c-b-f-e-i-k-h-d-j-a. 5. path: m-b-c-d-e-q-a-l-k-j-o-p-n-f-g-h-i, no 
circuit: by symmetry at p, delete any edge, choose p-m, at m and c rule 1 
forces subcircuit m-b-c-d-m. 

7. (a) rule 1 at f, h, j forces subcircuit e-f-g-h-i-j-e, (b) by symmetry at p, 
use m-p-n and delete p-o forcing k-o-i, at m and n, if both use edge going 
up (to g) then subcircuit p-m-g-n-p, if both m, n use edge down then 
subcircuit p-m-k-o-i-n-p, so by symmetry use m-g and n-i, delete i-h, i-c, i-j 
forcing g-h-b-c-d-j-k, plus k-o-i-n-p-m-g, yielding a subcircuit. 

9. (a) Hamilton circuit alternates between red and blue vertices and so must 
have equal numbers of each, (b) follows from part (a), (ce) (i), (ii) have 
odd number of vertices, (iii) has 9 vertices in one part and 7 vertices in 
other part of bipartition. 


11. (a) (b) many 


Edges between vertices at 
corresponding positions of 
the two cubes 


possibilities: 1-0000, 2-0001, 3-0011, 4-0010, 5-0110, 6-0111, 7-0101, 
8-0100, 9-1100, 10-1101, 11-1111, 12-1110, 13-1010, 14-1011, 15-1001, 
16-1000. 13. Many solutions but very tedious; the following heuristic 
works: at each stage, look at all possible squares (not yet visited) a knight’s 
move from current square and move to a square with the minimum number 
of possible squares for the next move. 15. (a) rook: starting at upper 
left corner, go down first column, square by square, at bottom of first 
column move right to bottom of second column, move up second column, 
square by square, continue going down and up successive columns finishing 
at top square of right column from which one moves back to top square of 
left column, king: similar to rook, except in columns 2, 3,...,  — 1, avoid 
top square, when top square of right column reached, move left along top 
squares of each column to return to top left square, (b) rook: not possible 
(variation on reasoning in Exer. 14), king: (1, 1)-(1, 2)-(Q, 2)-(1, 3)... (2, 
n), now go left and right covering rows 3 through n avoiding left square in 
each row except row 2, finishing at (1, 7) and now move up first column to 
starting square. 17. (a) n!, (b) form the Hamilton circuits by placing 
vertices in a circle: first circuit formed by joining consecutive vertices, 


0000 0001 
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second circuit formed by joining vertices 2 positions apart (one intervening 
vertex), third circuit formed by joining vertices 3 positions apart, and so on 
(no subcircuits because n prime); (c) form complete graph with professors 
as vertices, answer: 8 days, using part (b) to form 8 Hamilton circuits out of 
the edges of Ky. 19. if x, y nonadjacent, direct all edges inward at both 
x and y, now x and y can only be on a Hamilton path if they are the first 
vertex on the path—cannot be two first vertices. 


21. many possibilities, e.g., >< 


Section 2.3 


_ 1. (a). 3, has K;, (b) 4, an attempt to 3-color outer 6 vertices gives b, d, f 
different colors so that c requires a fourth color, (c) 2, (d) 4, an attempt 
to 3-color vertices clockwise around the circle starting at a forces j to have 
same color asa, (e) 2, (f) 2, (g) 4,4, 0, e, g forma K,, (h) 3, outer 
pentagon (odd-length circuit) cannot be 2-colored, (i) 4, an attempt to 
3-color the sequence of vertices a, e, b, f, c, g, d forces d to have same 
color as a, (j) 4, a 3-coloring forces a and h to have different colors, 
similarly for i and p so that a, h, i, p act like a Ky. 3. (b), (i), Cj). 

5. (a) {a, b, ch, {d, e, f}, {g, A}, (b) {a, b, eb, {c, a}, th gh, 

(c) {a, b, c}, {d, e, fF}, (9, h, if (i, kK: 7. (a) (i) h — 4K + 6K — 3k, 
(ii) K(k — 1(k — 2)(k — 3), (iii) K(k — 1K — 2), (b) smallest 
nonnegative integer for which P,(G) is positive is chromatic number of G. 
9. No, Nevada (or Kentucky or West Virginia) and its neighboring states 
have duals that form odd-length wheels. 11. 3. 

13. vertices = banquets, edges = require a common room, colors = days. 
15. vertices = mathematicians, edges = in conflict, colors = hotels. 

17. (a) graph of contestants and matches is K,, colors = days, edge 
coloring of K,, partitions edges (matches) into collections of matches that 
can be played on a common day, (b) n = 4, requires 3 days (colors), each 
day pair player a with one of the other players and then pair off the 
remaining two players, n = 5, requires 5 days (10 matches to be played and 
at most 2 matches—4 players can play one day), draw K; in standard form 
as pentagon with all chords, give different color to each edge of outer 
pentagon, give same color to each chord as color of outer edge parallel to it 
(this method extends to K, for any odd 7). 


Section 2.4 


1. Proceed as in Theorem 5 using induction and the fact that there is a 
vertex x of degree 5, but since 6 colors are available, x is immediately colored 
with a color different from those used by its 5 neighbors. 3. Either x, y have 
the same color—which is forced when x, y, are coalesced to form G{,—or 

they have different colors—which is forced when edge (x, y) is added to 
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form G;,. 5. (a) 3-color one triangle and then extend by successively 
coloring a vertex that lies on a triangle with two previously colored 
vertices, (b) process in part (a) yields unique coloring. 7. (a) If not 


connected, then k colors are needed to color one of G’s components and 
removing a vertex from another component will not reduce x(G), (b) if 
deg(x) = k — 2, then if G — x could be k — 1 colored, also G could be 

k — 1 colored by giving x one of the k — 1 colors not used by one of x’s k — 2 
(or less) neighbors, (c) if x disconnects so that G — x has components G’ 
and G” then one of G’ U x or G” U x requires k colors and removing a 

vertex from the other component will not reduce x(G). 9. (a) forn = 1, 
x(G) + x(G) = 2; assume for n —_1 and consider an n-vertex graph G; by 
induction we can color G — x and G — x, for any given vertex x, with a total of 
ncolors for both graphs (possibly fewer); x has a total of n — 1 edges in G and 
G and so is not adjacent to one of the color classes in one of G or G and can 
be added to that color class, although in the other graph x may require an 
additional color—for a total of n + 1, as required, (b) x(G)) = size of 
largest complete subgraph in G = q, size of largest independent set in G, and 
by Exer. 6(a), x(G)g =n, (c) square both sides of inequality, new inequality 
follows immediately from (b) and the fact that a? + b’ = 2ab. 11. (ke? - 
6k + 8) = 0 when k = 4and so P,(G) = 0 for k = 4, but the Four Color Theorem 
says that any planar graph can be 4-colored (i.e., has a positive number of 
4-colorings). 13. if: label with numbers that are the length of the longest 
path starting at the vertex, adjacent vertices must have different length longest 
paths since edge (x; y) implies that x’s longest path will be at least one greater 
than y’s longest path length, only if: let colors be numbers 0, 1,...,& — 1 and 
direct edges from larger to smaller numbers. 15. Successively color 
intervals (i.e., vertices) from left to right according to the order of their left 
endpoints, giving each interval a color unused by previously colored interval; 
most colors needed will be maximum number of intervals overlapping at one 
point = size of largest complete subgraph. 


CHAPTER THREE SOLUTIONS 
Section 3.1 


ce Aiea) OM eee Oe 2 
=< NIN NZ ELI oe 


3. no odd circuits => bipartite (Theorem 2 of Sect. 1.3) = 2-colorable. 

5. (a) no circuits means unique path from root to each vertex (two paths 
would yield a circuit), (b) since G is connected, a subset of edges can be 
chosen to form a tree containing all vertices of G, but this subgraph has one 
fewer edges than vertices (by Theorem 1) and if G contained other edges, 
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besides those in this tree, it would have as many vertices as edges, (c) if 
G has a circuit, removal of an edge on circuit does not disconnect G > G 
has no circuits; now use part (a). 7. Start at any vertex and trace a trail 
(no repeated edges); since no vertex is ever visited twice (if so, circuit 
would result) and graph is finite, trail must end at a vertex x of degree 1; 
now start trail-building again at x to get a second vertex of degree 1. 

9. height h = each path to a leaf passes through at most A internal 
vertices with a choice of m children to go to at each internal vertex, for a 
total of at most m" paths to a leaf. 11. i/n = [by Corollary part (a)] 
i(mi + 1) & i/mi = 1/m. 


13. Unbalancing a tree: re makes sum of level numbers 


larger and so smallest sum occurs when binary tree is balanced, in which 
case each level number is | log,/| or |log,/] + 1, and sum of level numbers 


is at least /[log,/|. 15. k choices for root, color parents before children 
and then each vertex when colored has k — 1 choices (any color except 
color of parent). 17. (a) internal vertices are +s heft 
. + 9 
KN 
3 1 2 5 


(b) | log,100 = 7, 19. leaves = letters, each leaf = n-digit binary 
sequence = 2" letters. 21. (a) 24, (b) 16, (c) 12, (d) 9 
tournaments (original and 8 losers’ tournaments). 


23. (a) | Hoes oe | = [login + I)J—1, (b) 


D E 


25. (a) 12|3T (b) first weighing is either one coin 


12 A\T 142 


1L 3H 2L 4L All 4H 2H 3L 1H 
fair 
on either side or two coins on either side, in either case some outcome has 
more than 3 possibilities that must be distinguished in the one additional 
weighing (impossible, for example with one on each side, there are 5 
possibilities if the scales balance). 
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Section 3.2 


1. (a) several depth-first search paths, (b) 4. 
3. (04) (2, 4) (6, 4) 


(7, 3) 


: (6, 0) 
5. (0, 0)-(0, 4)-(4, 0)-(4, 4)—now 2 quarts in 10-quart pitcher. 
7. right-hand-wall rule is same as depth-first search, which takes leftmost 


branch at every intersection (corner). 


9, a 11. Start with wives ABC and 


; yer 
aT | 
I TF 
go 
| a 


aan 


corresponding husbands abc on near shore; ab across and a return yields 
ABCac; ac across and a return yields ABCa/bc; BC across and Bb return 
yields ABab/Cc; AB across and c return yields abc/ABC;, ac across, c return, 
then bc across completes crossing. 13. (a) Start with M|M.M3C,C,C; on 
near shore; CC) across and C, return yields M,M,M3C,C3/C2; C:C3 across 
and C; return yields M,M.M;C,/C,C;; MM; across and M,C, return yields 
M,M,C,C2/M;C3; MM, across and C; return yields C,C,C;/M,M2M3; C.C3 
across, C; return, and C,C; across completes crossing (b) Let C, be the 
one cannibal that can row. Begin crossings as in part (a), then when 
M,.M,C,C,/M;C; on near shore, perform round trip of M,C, across and M;C; 
return to obtain M)>M3C.C,/M,C,. Now continue as in part (a) with indices 1 
and 3 interchanged. 


Section 3.3 


-1. (a) any 8-vertex path, (b) many possibilities, e.g., a-b-d-c-h-k-i-g-e-f-j, 
(c) many possibilities, e.g., a-b-c-d-e-f-g-h. 3. all trees on 5 vertices 
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[see solution to Exer. 1(b) in Section 3.1], (b) all trees on 4 vertices [see 
solution to Exer. 1(a) in Section 3.1], 


IA 


interchanged in last tree. 5. 4 components, x17, X19, X23 isolated vertices, 
depth-first spanning tree for other component has path x2-X4-X6-X3-Xo-X12-Xe-X10- 
Xs-Xi5-Xao-Xi4-X16-Xis-X2e-X24-X76-X13, Plus edges from xy to x5, from xj4 to x7 to 
Xx, and from x to x). 
7. (a) 59: path L-a-c-d-h-f-g-i-k-W plus edges (b, d), (d, e), (8, J), (& 1), (k, m), 
(b) 60: replace (k, W) by (m,W), (ce) L, j (other possibilities), 
(d) modify part (b) by deleting (a, c), (g, i), and adding (a, 5), (g, k). 
9. (a,b) 39: path N-b-c-d-e-g-j-m-R plus edges (d, h), (e, f), (f 1), UV, k). 

11. 13. (a) If not all vertices reached, some reached 


, places of d and c can be 


vertex would have an edge to an unreached vertex, but a depth-first search 


would use such an edge, (b) immediate. 15. If C has no edge of a 
spanning tree T, removal of C could not disconnect graph (spanning tree’s 
edges connected graph). 17. If more than one circuit results, then some 


edge e is on one circuit and not on other circuit, but removal of e leaves a 
circuit—impossible since n — 1-edge spanning subgraph of an n-vertex 
graph must be a tree [see solution of Exer. 5(b) of Section 3.1]. 

19. Prim’s algorithm gives minimal spanning tree and it will proceed in a 
unique fashion if all edges have distinct costs. 

21. Modification: add longest edge that does not form circuit. 

23. See solution of Exer. 17. 25. (a) If all edges of shortest length do 
not form circuit and they are not all in 7’, then add an omitted one and 
remove a longer edge in the resulting tree (like in proof of Prim’s algorithm) 
to obtain a shorter minimal spanning tree—impossible, 

(b) same reasoning as in part (a), (c) if: part (b) is property of minimal 
spanning tree used to prove validity of Prim’s algorithm; only if: verified in 
part (b). 


Section 3.4 

1. Cost is 11: 1-3-2-4-1. 5. Cost is 14: 7\-T5-Ty-T3-T,-T}. 
7. (a) 1-4-3-2, (b) 1-4-5-3-2-6-1, (©) 2-3-5-1-4-2. 9.21 
22 
21 


NNN 
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Section 3.5 


1. Outcomes are ordered lists. 3. A binary comparison tree has n! 
outcomes or leaves (as noted at the beginning of this section); so average 
number of comparisons = average leaf level = (by Exer. 13 in Section 3.1) 
logon! = n logyn. 5. 1,2,..., 7. 7. If the initial heap is balanced 
[as described in Exer. 6(a)], then largest level number is log.n and the 
number of comparisons to adjust the heap each time the root is removed will 
equal the largest level number, i.e., logan; for n iterations of removing the 
root and readjusting the heap, there will be nlog.n comparisons; constructing 
initial heap is similar. 11. (b) O(nlog,n). 


CHAPTER FOUR SOLUTIONS 


Section 4.1 


1. length 14: c-d-h-k-j-m. 3. (a) 31: L-c-d-f-g-k-W, 

(b) 32: L-b-h-j-m-W, (ce) 13: L-a-c-d-f-g-k-W, (d) L-c-d-f-g-k-W, 2 paths. 
5. (a) 5: L-b-h-j-m-W, (b) 6: L-c-d-f-g-k-W, (c) 6: L-c-e-g-j-m-W. 

7. If algorithm has found shortest paths to all vertices =m units from a, 
then a vertex x will be distance m + 1 from a if and only if the length 
k(y, x) from a vertex y (y is closer to a than x) plus the distance d(y) of y 
from a equals m + 1—this is exactly the test that the algorithm performs. 
9. Graph with negative-length circuit has shortest path of unboundedly 
negative length (go around and around the negative circuit); algorithm 
relabels vertices and never terminates. 

11. Define a tree by letting the first label of a vertex be its parent. 

13. The length of a sequence of strings is the physical length of that set of 
strings; when two ends pulled apart, they will be stopped by the shortest- 
length connection between the two ends, i.e., the shortest path. 


Section 4.2 


1. max flow = 13, P = {a, b, c}. 3. max flow = 50, P = {f, z}. 
5. (a) max flow = 13, P = {a, b, c}, 
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(c) «a 2 f 7_k 3? 8, 7. Set capacities of edges out 


of a and into z equal to 100 (equivalent to unlimited flow), max flow = 150, 
P = {a, b, c, d}. 9. 5, build paths by choosing leftmost unused edge 
leaving each vertex. 11. (a) 3 G edges leaving L), (b) 15, quintuple 
flow in part (a). 13. See Exer. 12 for modeling the vertex capacity 
constraint; max flow = 40, P = {f, f,, go. z}- 15. max flow = 2100, 
route 400 on dp-d>-23-24, 300 On ay-Cy-24, 400 on ay-b1-23-24, 200 On ay-b,-c2- 
d3-24, 400 on Ap-Q,-A3-Z4, 400 on Ay-A,-D 24. 17. max flow = 36. 

19. (a) algorithm tries to reduce flow in incoming edge before adding 

flow to outgoing edge, (b) yes. 21. Define tree with a vertex’s parent 
being its first label. 23. (a) put flow of | in the two edges between c 
and e; other edges no flow, (b) take flow in (a); and add a 2-unit flow 
path a-d-f-z. 27. If each flow path crosses a cut once, then the capacity 
of the cut = sum of values of such flow paths =. value of flow. 

29. (a) This is max flow—min cut theorem for model in Example 6, 

(b) make edge capacities 1 and build vertex constraints of 1 (see Exer. 12), 
result is max flow—min cut theorem for this network. 31. One needs to 
build an initial feasible flow (requires at least one edge entering and one 
edge leaving each vertex, except a and z), then use same algorithm to 
reduce, instead of increase flow along a flow path. 33. (a) In step 2a of 
augmenting flow algorithm, incoming flow cannot be reduced below its 
lower bound value; otherwise use same algorithm. 


Section 4.3 


1. (a) several possibilities, see (b), (b) using pairings A-G, Lo-J, first 
labels are (except for a, all second labels are 1): Bo-a*t, F-Bo*, G-Bo*, 
C-F~,A-G", J-C*, Bi-A*, Lo-J~, D-Bi-, La-Lo*, z-La*, new matching A-G, 
D-Bi, Lo-La, Bo-F, C-J. 3. Give edges from a and into z the appropriate 
supplies and demands. Middle edges still ©, one solution: A-Bi (3 dates), 
D-Bi (2), Bo-F (1), Lo-F (2), C-F (1), A-G (1), Bo-G (2), C-J (3), Lo-J (2), 
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D-La (3). 5. no, schools with demand of 7 Ph.D.s can hire at most 6 
(one from each university). 9. (a,b) Make complete bipartite graph, 
each vertex of degree n; by Example 3 there is a pairing for the first night; 
remove these edges; now again by Example 3, there is a pairing for the 
second night; continue in this fashion. 11. Necessary and sufficient 
condition: for any set S of vertices, |S| = |s(S)|, where s(S) is the set of 
successors of vertices in S (vertices with an edge coming in from a vertex in 
S); this condition guarantees a complete matching in hinted bipartite graph, 
which corresponds to a set of edges in original graph with one edge out of 
and one edge into each vertex. 13. For each left-side vertex, x, form a 
set of x’s (right-side) neighbors. 


CHAPTER FIVE SOLUTIONS 


Section 5.1 


1. (a) 12,56, (b) 4, 15. 3. : xX 10 X 6 X 4, 

5. 26', 26 X 25 X 24 X 23. 7. (a) 8+7+5, (b)8X7X5, 
Cree ee ee ey wentee ae er ee 
9. (a) 4 X 47 = 188, (b) (1 X 48) + (12 X 47). 11. 12. 

13. (a) 9 X 10%, (b) 5 X9 X 10°, (c) 94 + (4 X 8 X 93), 

(d) see Supplement at end of Chapter 5. 15. 52 X 48/52 X 51. 
17. (a) 10X99 X 8,10 X9X 8", (b)O9X8XxKI9X8 xX 8, 
() [dl x9xX8xX1I+9OXxX8X7XDIPX9X8X8B. 

19. (6 X 5 X 4) — (5 X 4 X 3). 

21. (6X5 xX 4)- (4X3 X 2)or(2X 3X4 xX 3) 4+ (3X2 x 4), 
23. 4 X 3 X 87/10'—first position 8 and then position 9. 

25. (a) 7°, (b) 20 X 19 x 58, 27. See Supplement at end of Chapter 5. 
29. See Supplement at end of Chapter 5. 31. 2 X 25/50 X 49. 
33. [(3 X 3 X 2) + (2 X 3!)]/6’—pick the possible smallest value 

(3 choices), count ways of picking middle value and then arranging them 
(3 X 2 possibilities when middle value same as largest or smallest). 
35. 2. 37. 16 X (8 X 7/2), n X m X (8 X 7/2). 

39. {[28 X (64 — 22) (Queen on edge of board)] + [20 x (64 — 24) 
(Queen one away from edge of board)] + [12 X (64 — 26)] + 

[4 x (64 — 28)]}/2, 

41. 2'°—each friend is or is not in the subset. 43.2X3X6. 


Section 5.2 


1.52! 3. P(15,9). 5. P(8, 5). ~— 7. C(10, 5), C(8, 3). 

9. {C(10, 8) + C(10, 9) + C10, 10)}/2". 1. Cin, 9)2"-. 

13. (a) C(4, 2) x {C(6, 4) + C(6, 5) + C(6, 6)} + [C(4, 3) x CO, 6)], 
(b) {(C(4, 2) X C(, 2)] — B X 5)} + {IC(4, 1) X CO, 3)] — CG, 2)} + 
C6, 4). 15.3! 6! xX 8!X 5! 17. C21, 5) X 10!; 2 x 51/101. 
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19. (a) 2 X 51/6! = 1/3—treat ab or ba as a single letter, (b) C(6, 2) X 
4!/6!—pick pair of positions for a, b (with a going in the first position); by 
symmetry, answer is seen to be 1/2. 
21. (a) C(n, 3), (b) C(n — m, 3) + [m X Cn — m, 2)} 23. 12 
25. (a) 1-vote 1/6, 2-vote 1/2, (b) 1-vote 1/6, 2-vote 1/3, 
(c) 1-vote 468/7!, 2-vote 1056/7!, (d) 1-vote 396/7!, 2-vote 864/7!. 
27. C(30, 5)'"/C(360, 60). 29. (a) {[(3 X 4) + (2 X 5)] X 8!}/10!, 
(b) {1 X 9 X 8!) + (8 X 8 X 8ID}/I10!. 31. (a) {[C(4, 1)* x CG6, 1] 
+ [4 x C(4, 2) x C4, IP IVC(S2, 5), 
Me {C(26, 5) + C(13, 19°C (26, 3) + C(13, 2)’ X C(26, 1)/C(S2, 5). 

3. See Supplement at end of Chapter 5. 
ae {C(30, 2) X C(28, 2) X C(26, 2) X C(24, 2)}/4!. 
37. 4 X 3 X P(8, 6). 39. (a) See Supplement at end of Chapter 5. 
41. C(64, 8) X C(56, 8), C(B2, 4) X C(28, 4). 
43. See Supplement at end of Chapter 5. 45. C(6, 2) X {2° — 2}/6. 
47. See Supplement at end of Chapter 5. 
49. (a) C(10, 2) X C(k — 10, 18)/C(k, 20), (b) k = 100. 
51. (a) 4!, (b) 4 X 3 X {4 + 6}. 53. (a) P(C(5, 3), 3), P(C(n, 3), 3). 
55. See Supplement at end of Chapter 5. 
57. (a) C(45, 3) + C(45, 2) X C(45, 1), (b) [3 < C(30, 3) (three integers 
with same value mod 3)] + C(30, 1)° (each integer with different value mod 
3), (ce) as in (b), break into cases based on the value mod 4: C(22, 3) + 
[C(23, 2) X C(22, 1)] + [2 X C(23, 2) X C(45, 1)] + [C(23, 1) X C22, 2)] 
+ 22? x 23, 59. (a) [(2 x 2°) + (3 X 271/28, (b) {[(2 x 2°) + (3 X 
2] + (22 + (2 X 2)) + (2 X 2) + 17 +24 12% 
61. C(8, 3), C(8, 3) — (8 X 5). 
63. See Supplement of Selected Solutions at end of Chapter 5. 
65. (a) C(10, 6), 
(b) C(10, 6) + [5 X C(10, 5)] + [4 x C10, 4)] + C(O, 3). 
67. See Supplement of Selected Solutions at end of Chapter 5. 


Section 5.3 


1. 6!/112!3!. 3. (a) 3°, (b) (6!/2!°)/3° 5. C(10 + 4 — 1, 10). 

7. C(9 + 3 — 1,9), subtract cases where one party has a majority (8 or 

more) C9 +3 —- 1,9) -[3 X CG +3 - 1,3). 

9. (a) C110 + 3 — 1,10), (b) C5 4+ 3-1, 15)- C4+3— 1,4). 
LL 6/3121! + 6f/2!2Nt1!. 

13. C(10, 6) X {[C(6, 2) X 8!/2!2!1!4] + [6 x 8!/3!1 Py} 

15. Consider the cases of: (i) 4 of one letter and | of another letter, (ii) 3 of 
one kind and 2 of another letter, (iii) 3 of one letter and 1 of two others, (iv) 
2 of two letters and 1 of another, and (iv) 2 of one letter and 1 of three 
others—(2 X 3 X 51/411!) + (2 XK 2 X 51/3!2!) + [2 X CG, 2) X S!/ 
B!1!L!} + [CG, 2) x 2 x 5!/2!2!14] + GB X 51/2118). 

17. See Supplement of Selected Solutions at end of Chapter 5. 
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19. Counts all (1, 2, 3)-sequences of length 10 two ways: left side looks at 
all cases of k, 1s, k, 2s, and k; 3s, while right side counts all (unrestricted) 
10-digit sequences of 1s, 2s, 3s. 

21. See Supplement of Selected Solutions at end of Chapter 5. 

23. See Supplement of Selected Solutions at end of Chapter 5. 

25. Must have b or d after each c (except possibly last c); sum is over 
number of bs following cs (first sum when last c followed by b or d, second 
sum when last c is at end of sequence: > C(3, kK)21!/7!131(8 — /I[6 -— B - 
A]! + J CQ, k)201/7!21(8 — KI6 — (2 - b)! 


Section 5.4 


1. (a) C440 + 4- 1,40), () 1, (©) CG64+4-1, 36). 

3. (a) C(13, 4) X C13, 3y°/C(S2, 13), (b) [13!/5!5!2!11! x 391/ 
SISt11!12!]/521/13!4, (ce) 4 X C(48, 9)/C(52, 13), 

(d) 4! X 13!/4!3!°/521/13!4, 

5. C10 + 4-1, 10) x C(44+4- 1,4) x C644 - 1,6). 

6.3 X C(24, 16) x 2% 7. CUS -— 2) + 4-1,  — 2)] X Si. 
9. C[(14 — 5) + (7 — 1), (14 — 5)}/C20, 14). 11. C15 + 3 — 1, 15). 
13. (a) [52!/13!4]/4!, (b) [52!/8!97!4]/3! x 4!. 

BB? (a) 4 xX CO +4- 1,9), (b) PO, 3) x CO +4- 1,9), 

(c) distribute teddies and fill out each child with lollipops: 4°. 

17. C(7 -— 3) + 5-1, (7 — 3)] X 7N4l2i1t. 

19. See Supplement of Selected Solutions at end of Chapter 5. 

21. [10!/21)/5". 23. (a) distributions of 8 distinct items into 3 boxes, 
(b) distributions of 9 distinct items into 3 boxes with two items in the first 
box, etc. 

25. (a) distributions of 6 identical objects into 31 boxes, 33!,x; = 6, 

(b) distributions of 5 identical objects into 3 boxes with at most 4 objects in 
first box, etc., 22.4.x, = 5, x, = 5,4 = 4, x3 S 2. 

27. C(30 + 3 — 1, 30), 3 x C[(30 — 16) + 3 — 1, 30 — 16))]. 

29. (a) C77 + 4 — 1,7), 

(b) C7 +5 -—1,7) +1 (©) [(CU3 + 4-1, 13] - (4X CB +4 - 1). 
31. DoC(k + 2 -— 1,4) X C[(12 — 2k) + 2 — 1, (12 — 2k]. 

33. Li-o0C(7 + 3 — 1,7) X C[20 —- bh + 4 - 1, 20 - b). 

35. (a) 5 X 6 X 8, (b) (5 X 6 X 8) — 2. 

37. (a) 5! x C9, 5), (b) 21! X 5! X C(22, 5). 

39. di20C13, A) X C39, 13 — k) X C(26 + k, kK) X C(26, 13). 

41. > 15!/a!b!c! summing over all a, b, c 3-tuples where a + b + c = 15 
with no letter greater than 7. 

43. (3! X 25!/16!8!1!) + (3 X 25!/14!7!4! x 24) + 3 X [251/12!6!7! x 
(27 — 2) — 25!/6!6!1!]} + 25!/10!5P. 

45. See Supplement of Selected Solutions at end of Chapter 5. 

47. See Supplement of Selected Solutions at end of Chapter 5. 

49. C[(n — 2m) + (2m + 2) — 1, (n — 2m)} {simplifies to C(n + 1, 2m + 1). 
51. yo CkK+m-1AXChr-Hh+n-m—-1,¢—- bl. 
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Section 5.5 


8. n = 8; 

11. (b) [C~ + 1, 2)] + [6 X C(n + 1, 3)] + [6 X Cin + 1, 4]. 
13. (a) 2" + (nX 2", (b) 15 X 2" 

21. $ X CQn + 2,n + 1) — C(2n, n). 23. 0. 


Section 5.6 


1. 1234, 1243, 1324, 1342, etc., 3. 1234, 1235, 1236, 1245, 1246, 1256, 
1345, 1346, 1356, 1456, 2345, 2346, 2356, 2456, 3456. 
5. (a) 123456789(10), (10)987654321, (b) abcd, wxyz. 


CHAPTER SIX SOLUTIONS 
Section 6.1 


1. (a) 8 products—xxxx, x? Lex, x°x 1x, x2xx1, Lexx, 2x? Lx, 202x1, xx v1, 
(b) 15 products—1 12", face? Vx, TR 1 ee LL, L, Dice all 
xxl, x2xx, xx2x, xxx, x17, (0) 10 products—x411, 1x‘11, Pell, x?1x?1, 
Pine: x11", 1x?x71, 1x?xx, Lx'L’, Lix’x (d) 15 products—x‘11, x*x1, 
1K, Ky eee, 71K, P12 KE, ae Ix*l, 17x, Lx*x*,. Lex? 1194. 
3. (a) 0d t+x4+ x + +x txt + x3 + xt, 

(b) (K+ x24 x84 x8 + PV 4x7 4+ PDK H x Fee Hx? + x9), 
() dtxtxr?+-°) @MatKPH+ RO + -°- PU Fax txr teh 
5. (tx tx? 4+ -°-P(l + x), coef. of x°. 

7.0 + x4 +25 + ---)', coefficient of x'®. 9 (lteter tees) 
t+ er txtte dat Pt ete) txt? tee yr? 
13. (+x? + x3 + xt + x9 + x)", 

15. G+ x72 txt tLe xtxret x3 

17. 1 +x + x + eh 1. it xtxerte Jat xr+xet 
-- SC +x +x? + ++ -), coefficient of x, generally x” 

21. Cannot have a variable number of ee 

23. Lt x tx? tee) t+ xh + x! + ee yd + xl + x7 4 ++), 

2. (1txtxr+--- Pl tyty + yy 27. (xy + xz + yz)’. 
29. C(p, n), n!. 


Section 6.2 


1.C( +n— 1,8). 3. C(m, 9) + C(m, 7) + Ci, 5). 

5. C10 +4- 1,10) -4xC4+4—-1,4). 

7. (a) C110 + 10 — 1, 10), 

(b) C0 +4 - 1,10) -3X C01 +4-1,11, (© 0, 

(a) C(1L + 2-1, 11) +3 x C(I2 +2 ~ 1,12), (&) 6 x C(12, m). 
9.0. 1. (a) 0, (b) 1, () C12 + 8-1, 12), 

(d) 42x C2 +5-1,12), @ C4+4-1,4). 
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13. (a) @? + x38 + x4 4+---)3, C10 — 6) + 3 — 1, 10 — 6), 

(b) I txt x1 +x4+42x24+---),C00 4+ 3 —- 1, 10) - 

C7 +3-1,7), (©) (A+2x24+x44+--)0¢ x4 22 4-5-9, 

> cdo — 2k)+2-1,00-h). 15. C24+5-1,12)-5*x 
C7 +5 - 1,7) + C6, 2) x C2 + 5-1, 2). 

17. (a) CUS +n — 1,15) + C110 +n— 1,10), (b) Cm, 15) + Cin, 10). 
19. C8 +7-1,8)-7XCB+7-1,3). 

21. C6 +3-1,6)-CB+3-1,3)—-C2+3-1,2). 

23. C(14 + 10 — 1, 14) - 4 x C10 + 10 — 1, 10) +6 X C77 +: 10 - 
1,7) + C(4, 2) X C6 + 10- 1,6)+ 4X6xXCB+10-1,3)+4~x 
C72 + 10— 1,2) + C(@,2). 25. {C10 + 4-1, 10) - [4x ci4t+ 
4—-1,4] x c”5+4-1,15)-4x C9 +4- 1,9} + CO, 2) x 
C3+4-1,3)). 29 (a) Cmt+nmtn), 

(b) 0, if r odd, C(m, r/2), r even, (ce) 3”. 31. (b) 2. 


35. (b) m/2, pn, 10, mip. 37. Px(t) = (3)( ey 


2 1 — (t/2) 
Section 6.3 
1. (a) 5 partitions—4,3 + 1,2+14+1,2+2,14+1+1+4+1 
(b) 11 partitions—6,54+ 1,44+2,44+14+1,34+3,3+24+1,3 + 
1+14+1,24+24+2,24+24+14+1,2+14+1+4+1+4+1,1+1+ 
1+14+141. 
3 (Ll +x tx? + QL + x? + xt + x1 + 0? + x8 + LL, 
5. (1 — x) — x) = x) — x7. (b) (I + + x) 


1- 2 — 4 1— x° _ 8 
d+ x)...= FEZ)\LES)EX)(1ES) ... = cater 


1 
(dQ —xnd — x) — x5)... 


generating function (given just before Example 1) times 


canceling) 19. (a) Multiply the partition 


’ 


1 = 

(b) (I — xy I'd — x) {dd — ey! - d — x44, (©) same as (b). 

21. (a) Let the number of dots forming the first row and first column 

(= 2k — | if first row and column have length &) in a self-conjugate Ferrers 
diagram be the length of the first row in the distinct, odd-parts diagram; 
delete the first row and colunn of the self-conjugate diagram and use the 
number of dots in the reduced self-conjugate diagram to define the second 
row of the distinct, odd-parts diagram, etc. 


U 


Section 6.4 


aaa : 5 2lx 
L(iteth +h 4e 42). 3. (1 + xye?”. 
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3 4\ 13 
5. (1 +x as — i” i +— is ‘) , coefficient of x!7/13!. 7. (a) 1/23" + 1), 
(b) 1/4[3" + i (-1)1. 9. (a) 22!° + 4 X P(10, 1) X 22°+ 6X 
P(10, 2) X 228 + 4 x P(10, 3) X 22’ + P(10, 4) X 22°, 
(b) 26 — 4 x 25 + 6 X 24! — 4 x 23” + 22". 11. 34’. 
13. (a) eX — 2 X e@ DH + en” (hb) [n’ — An — 1) + (n — 2)Y/n’. 
15. (a) e*/x, (b) (1 -— x)! 21. eX, 


Section 6.5 


1. (a) x/1 — x¥, (b) 13/2 — 9, (©) 3x01 + 0/1 — 0), 
(d) [3x/ — x] + (7/1 — x], (©) 4tx4/1 — 2). 

3. x3 — x»/01 — x). 5. (a) (4x? — 3x + 1)/01 — x), 
(b) log.(1 — x). 


CHAPTER SEVEN SOLUTIONS 


Section 7.1 

1. a, = 5a,-1, I = 1 ora, = 5. 3. Gy = Gy-) + 2Gn-2. 

5. Gn = 3dy-) + 2Gq-5 + An-10 + An-25- 7. Gn = An) + Gy-2- 
9. a, = Qn-1 + An-2 11. a, = Qi) + Gy-2. 


13. (a) a, = a,-; + n,n > k, initial condition: a, = k + 1, 

(b) 43. 15. a, = 1.06(a,-; + 50). 17. Gy = 2a,-; — Qr-3 + 2" (the 

2nd and 3rd terms count sequences that start with 001 followed by (n — 3)- 

digit sequences with no consecutive Os). 19. a, = 2a,-; + 4". 

21. a, = 3ay-1 — GQn-3- 23. (a) a, = 2ay-1, Ay = Man-1, (b) a, = 

An-| + 2", Ay = An) + mM". 25. (a) P(k, 4) + P(k, 3), - (b) P(k, 4) + 

P(k, 3) + P(k, 2), (c) P(k, 5) + 3P(k, 4) + 2P(k, 3). 

27. Ang = Ang-1 F An-1k—-1 FH Gn-24h-1 TF Gn-3.-1- 

29. Gk = An-1k-1 + Ane - 

31. a, = Dy + Cnty Dp = Cy = 27! — Dae ~ Cy-1- 

33. a, = such sequences starting with a 0, b, = such sequences starting 

with a 1, c, = n-digit binary sequences with no consecutive 0s, a, = Dy-1 + 

Cn-3s Dn = An-1 + Danis Cn = Cn-1 TF Cn-2- 

35. p, = ways to hand out a penny, nickel, or dime on successive days with 

a penny on the first day, n, and d, are defined similarly, p, = M—-, + dn-1, 

Mn = Pn-s + Ay~52 dn = Pn-10 + Mn-10- STs BG, = 5Cy 

39. (a) a, = 2a,-, (b) a, = 2"', (ce) the first k integers in the sequence 

will form a set of consecutive integers, the (k + 1)-st integer can be the next 

larger or the next smaller number to extend this consecutive set. 

41. (a) fn, Hb =fam-1,)+fam—2,k - 1). 43. a, = a,-;,n nota 

multiple of 5, a, = @,-; + 2, n a multiple of 5 but not 10, a, = a,-; + 3, 
n+1 

rae 


na multiple of 10. “45. a, = Gy-3, Nn EVEN, A, = A,-3 + | 
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Section 7.2 
1. (a) An — 5, (b) An’? + 2n, (c€) A + 4n — [log,n]|, (d) An — 2. 


3. (a) a, = anno + 1, a, a logion, (b) a, = 104,10 ly 1, ay = » n— 3. 
5. a, = 24,2 + n— 1, a, = nlogn—n-t 1. 
7. (a). Pick largest from first half and largest from second half and 


compare, (b) a, = 2a. + 1, (ec) a,=n-— 1. 


Section 7.3 
1. (1.08)" x 1000. 3. (a) a, = 24" + 3(-1), (b) a, = 1, (¢) a, = 2, 


(d) a, = 42 — in + 1. vi 2-3 
5. a, = 2ay-) + Qy-2, Qn = eave —— (1+ V2)" + 


7. Pn — Pri = 2(Pn-1 = Pn-2)s ae (3 x 2") = Ds 
11. c, = 9, ce. =,—-18. 


(dd — V2y". 


Section 7.4 


1. (a) a, = 3C(n, 2) + 1, (b) a, = 2C(n + 1,3) + 3, (Ca, = 

6C(n + 2, 3) + 3C(n + 1, 2). 3. (a) a, = a,-; + n, a, = C(n + 1, 2), 
(b) a, = a,-; + 2C(n, 2) + n, a, = 2C(n + 1, 3) + C(n + 1, 2). 

5. 4a, =a,, + Bik x(n-2-hb + lha,= Cn, 4) 

+ C(n, 2) —n+ 1. 7. a, = 1250(1.08)" — 1250. 9, (a) —3" + 

1, (b) 32"+3-D", (© (3X 2")—n-2, (d) (15 X 2") — 2n? — 8n — 12. 
11. 3 X 2" — 3n — 2. 


13. a, = 2a,-; — a,-2 + (10 X 24, a, = 960n — 20 + (40 X 2"). 
WAX24+BX2" +34 3 19, (a) a, = V2"! - 1, 
(b) a, = n!, n even, a, = 0, n odd. 


Section 7.5 


1. (@) gx) — 1 = age) +, 


(b)_ g(a) — x — 1 = Baga) — 3 — 27909 +, 


2 2 
(c) g(x) — 1 = xg(x) + i= (d) g@) — 1 = 280) +7 
3. (a) g(x) — 1 = xg(xy, (b) g(x) — 1 — x — x? = [g(x) — 1 — xP, 
() gx) -1—-x= =a Le(x) - 1. 
7. Ay = GxQ,-| + 30,2 °° * + a,-1%,n= 1a, = 1, a, = 


— C(2n — 4,n — 2). 9. Gy = QQn,-) + Qa,-2 + °° + G1, 
i= 


n= 2,a, = 1,a, = *CQn —2,n— 1). 11. a, = P44 + GAn-igs 
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F(x) = (q + px)". 13. similar to recurrence relations in Example 5 except 
with 4”"! replacing 3"! and a, = 0 instead of a, = 1 (still b} = c, = 1), 

a, = &(4" — 1), n even, = i(4" — 4), n odd. 15. (a) a, = Cin - 1, 
k— l)a,-1, (b) g(x) =e. 


CHAPTER EIGHT SOLUTIONS 


Section 8.1 


1.2xX5 X 26+ — 5S? x 26°. 3.37 -3 x 20 

5. {C(52, 5) — C(13, 5) X C(4, 1~C(52, 5). 

7.4x (9+9 xX 8 X CQ, 2). 9. 600 — 200 — 200 — 100. 

11. (a) 200 — 70 — 100 + 30. (b) 200 — 100 — 60. 

13. (a) 200 — 3 X 80 + 3 X 30-15, (b) 80-2 X 30 + 15. 

15. 30-15-10-6+5+3+2-1. 17, 3° —-3 x 2%°+3%X 1. 
19, 6!/28 — 3 x 51/2!" + 3 xX 41/2! — 3!y 

21. 3 X C(6, 2) X 4! — {2 x C(6, 3) X 3! + 6!/2!7} + C(6, 4) X 2!. 
23. (15!/3! — 3 x 5! x 10!/2! + 3 Xx 58 — 5P)/(151/3!). 25. 20. 
27. N(Y) = NY — K) + NY NK) = 50 + 20. 

29. N(Salad) = 70, N (sandwiches) = 45. 


Section 8.2 


1. 10°.- 3 X 9° +3 X 8B" -— 7" 3. 13 X C(48, 6) — C(13, 2) X C(44, 2). 
5. (a) {C(52, 13) — C(4, 1) X C9, 13) + C(4, 2) x C(26, 13) — C(4, 3) 
x C(13, 13)/C(52, 13), (b) {C(4, 1) X C9, 13) — C4, 2) x C(26, 13) 
+ C(4, 3) X C(13, 13)}/C(52, 13), (©) {C(52, 13) — C4, 1) x C(48, 13) 
+ C(4, 2) X C(44, 13) — C(4, 3) X C(40, 13) + C(36, 13)}/C(52, 13). 

7, 91/3 — 3 x 71/3!? + 3 X 5!/3! — 3}. 

9. 26! — {3 X 23! + 24!} + {2 x 20! + 2 X 21!} — 18}. 

11. C25 + 6 — 1,25) - 3x C8 +6-1,18)+3xCl+6-1, 
11)- C4+6- 1,4). 13. C(i2 + 4 — 1, 12) — C4, 1) X C0 + 
3 — 1,10) + C(4, 2) x 9 — C4, 3) X 1. 15. 27. 17. ~ 10!/e. 
19. 10!/2!5 x {101/215 — C(5, 1) X 8!/2!4 + C(5, 2) x 61/2 — C(S, 3) X 
41/2? + C(5, 4) — C(5, 5)}. 

21. 8 — C(6, 1) X 8° + C(6, 2) x 8 — {(C(, 3) — 2) X 8 + 2 X 8) + 
{C(6, 4) — C(6, 5) + C(6, 6)} X 8. 23. (a) n° — C(5, 1) X nt + CG, 
2) x n? — C(5, 3) X w+ {C(5, 4) — C(5, 5)} Xn; (b) *® — CIS, 1) X 
n+ C(5, 2) X n — {C(5, 3) — 1) XW? + We} + (C6, 4 -— 2) X nt 
2X wt—n. (© n> — C(7,1) X nt + CCT, 2) X nw — {C(7, 3) — 3) X 
n+3 x n}+ {(C(7, 4) - 14) xX n+ 14 x Ws - {(C77, 5) -— 2) Xn + 
2 x nt + {C(7, 6) — C7, 7)} X 7. 25. D,. 27. if 21 = C2 + 

6 — 1, 2), then > (—1)' X C(6, &) X (21 — by”. 

29, 51" xX {(2n —“I)! — & C(5, k) X Qn -— 1 — BI}. 

31. {& (-1¥ X Pn, ky X (n — Kal. 
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35, Srh(-1)**! & Clk 3) X C(n, Ey X (n — BY, DE -1)**! x 

C(k — 1, 2) X C(n, k) X (n — ky. 

37. [C(5, 2) X 91/28] — [2 x C(5, 3) X 81/21] + [3 x CO, 4) X 71/2!) — 
[4x 6!]. 45. Stok xX {St (-1* x CY, bX nly}. 


Section 8.3 


1. 5 X 5 board with darkened squares on main diagonal. 

3. 7! — 9 X 6!) + (G0 X 5!) — (46 X 4!) + (32 X 3!) — (8 X 21). 

5. 5! — (7 X 4!) + (16 X 3!) - (13 X 2!) + (2 X 1D). 7. 3. 

9. (a) 4 X 5 board with darkened squares in 4 positions just to right of 
main diagonal, (b) (x + 1)4, (©) 324-1) & CU, bk) X C(n — 1, j) X 
(n — j)l. 11. 2 X 2 array of darkened squares and ‘‘L’’ (a column of 3 
squares beside a single square both have 1 + 4x + 2x’), 


CHAPTER NINE SOLUTIONS 
Section 9.1 


1. (a) not symmetric, (b) yes, (c) not transitive, (d) not transitive, 
(e) not transitive. 3. (a) 6 symmetries (as in Example 3), 
(b) 4 symmetries, (c) 1 symmetry. 5. (a) all C; left fixed, 
OC 

C1 Cy C3 Cy Cs Co C7 Cg Co Cro Cu Ci. C3 Cg Cis Cis 


ONS earthy seca ae etd 
C, C3 Cy C; C, C; Cs Cy Cs Ci Cio Cy Cu Cis Cy Cre 


Wee eee 

C C, Cs Cy C; Cy Cy Ci Cs Cio Ci Cis C3 Cy Cis Cie 

11. (a) a, b, c are rotations of 0°, 120°, and 240°, respectively d, e, f are 
flips around vertical axis, axis 30° clockwise of vertical, and axis 30° 
counterclockwise of vertical; row is first symmetry, column second 
symmetry: 


abe 


d 
d 
f 
e 
a 
c 
b 


QF Awl 


“SH & QO SS 


(b) straightforward, 


ie (EOF eg ol 1294) eS 1284)\ ge ee 
(co) leta = \ i434) 2143)’ 3412)’ 4321)” 
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Aa 78 


13. only left structure (right structure has 6 isomers). 
21. (b) {,, 73, 7s. M$ OF {7), 73, 77, Te}, (€) in addition to subgroups in (b) 
and G, G’, G”, other subgroups are {7,7;}, for i = 3, 5, 6, 8. 


Section 9.2 


1. (a) 11, (b) 24. 3. 435 + (2 X 3°). 5. 51. 

7, 3(5" + 5”) n even, and 3[5" + (3 X 5~)] n even. 

9. (a) in cycle form: 7, = (1)(2)(3), 7 = (12)(3), 73 = (13)(2), m4 = (1)(23), 
m5 = (123), m = (132), (b) (a) = C12 + 3 — 1, 12) = 91, Wm) = (7s) 
= h(a) = 7, Was) = (a) = 1, answer: {91 + 7+ 74+ 7+14+ 1= 
19. 11. (7) = 18, W(73) = 6, Wa) = (mg) = 12, and other (7;)’ = 
0, answer #(18 + 6 + 12 + 12) = 6; 13. (b) {7,, 73 7%, Ts}, (ce) {m, 
Tet. 


Section 9.3 


1. 55. 3. (a) 130, (b) 92, 

(c) cyclic color sequence on hexagon of R-W-B-R-W-W and R-B-W-R-W-W. 
5. (a) 40m‘ + 2m? + 3m), (b) am? + 2m? + m> + 4m), 

(c) 3m? + m°). 7. (a) gm® + 2m? + 3m‘), 

(b) am” + 2m? + 3m® + 2m’), (c) xm° + m’‘). 

9, 7 + BX 7] = 637. 11. (a) 3(2° + 2*) = 136, 

(b) 4728 + 24+ 0+ 2‘) = 72. 13. (7;) = number of cycles of length 1. 
15. (a) alm? + (p — 1) X m), 


(b) L{m? + [(p — 1) X m] + (p X mPP?)}. 


2p 
Section 9.4 


1. b+ btw + 2b3w? + 2b’w? + bwt + we. 3. bt + wit rt + Dw + 
br + bw3 + wr + br? + wr? + 2b’w? + 2b’? + 2w?r? + 2b’wr + 

2Qbw2r + 2bwr?. 5. (a) Hib + wy + 206 + wb + w) + 3°? + Ww’) 

(b+ wy} (b) Hib + w) + 2b + wy + (+ wb + wyh, 

(c) H(b + w)? + 2(bt + wt + 3(b? + w/e + 2B? + w2)(b + wy fe 

7. (a) H(b + wy) + 2b? + w?P + 3(b? + wb + wy, 

(b) H(b + w)’ + 20h + w7P(b + w) + (Bb? + wb + wy}. 

9. (a) &{(b + wy) + 8(b + w\(b + w) + 30 + wh, (b) A{(D + w)® 
+ 6(b4 + w4\(b + wy? + 3(B + wb + wy + 6b? + Ww? + 82 + 
w>}}. 11. A{(b + wy + 6(b* + BY) + 8(b? + w3\(b + W) + 3(B + WP 
+ 6(b? + w2)(b + wy}. 13. (a) if not a cyclic rotation of all corners, 
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the length of the cycle would have to divide p—impossible, (b) C(p, k)/ 
p. 15. (a) 36, (b) 216. 


CHAPTER TEN SOLUTIONS 
Section 10.1 


1. (G) 3. One labeled factor must be 
ey >) © 
1 
3A V4 3 
(B) Dae 


1: W-B, 2: G-R, 3: B-G, 4: R-W, second labeled factor can be 1 & 2: G-B, 
3: R-R, 4: W-W or 1: B-G, 2: W-B, 3: R-W, 4: G-R (or interchange 1 and 2). 
5. (a) yes, a Hamilton circuit is a factor, 

(b) having a Euler circuit has no relation to having a factor. 


Section 10.2 


1. (a) {a, c} or {b, d}, (b) f (c) no kernel, consider directed 5-circuit 
b, a, d, g, h, b, a—if ais K (kernel), then d not in K, then g in K, then h 
not in K, then b in K—impossible since a in K; similar sort of argument 
(also involving c, f, e) if a not in K. 

3. {3, 4, 9, 11, 12, 16, 17, 21, 25, 26, 27, 31, 32, 36, over 40}. 

5. A goes to 2, B must go to 4 or else A will win. 

7. Move to multiples of 5k + 1 (initial position is win for second player). 

9. By symmetry assume g(a) = 0, then g(e) = 1, then g(d) = 0, then 

g(c) = 1, then g(b) = 0, but now two kernel vertices are adjacent. 11. S is 
a kernel if and only if all vertices not in S have an edge to a vertex in S$ 
while no vertex in S has an edge to a vertex in S, that is, if and only if 
W(S) = S. 13. Follows from parts (a) and (b) of Exercise 12 since 

g(x) = k means there is a path of length k starting at x while /(x) is length 
of longest path starting at x; longest path is at least length k (maybe there 

is another longer path starting at x). 15. Suppose x and y are adjacent 
because there is an edge from x to y; then x must have a larger level number 
than y and a different Grundy value from its successor y. 17. If there 
were an infinite number of vertices, then one of the finite number of starting 
vertices, call it x;, must have an infinite number of vertices reachable from 
it, and one of the finite number of successors of x,, call it x , must have an 
infinite number of vertices reachable from it, and one of the finite number of 
successors Of x, call it x3, must have an infinite number of vertices 
reachable from it, and so on, without end. 19. Let a = 0000, b = 000, 
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c = 0.00, d = 00_0, e = 00, f = 0_0, g = 0, A = _ (win); s(a) = {b, c, d, 
e, f}, s(b) = fe, f, g}, s(c) = s(d) = fe, f, gh, s(e) = {g, A}. s(f) = {a}, 
s(g) = h, s(h} = @; g(f) = ah) = 0, g(a) = 8(g) = 1, gfe) = 2, g(b) = 
g(c) = g(d) = 3. 


Section 10.3 


1. (a) 7, remove 3 from 4"pile, (b) 0, 

(c) 4, remove 4 from 2™, 3%, or 4" pile, (d) 0. 

3. (a) 3, remove 3 from 3” pile or 2 from 4" pile, 

(b) 2, remove 2 from 3" or 4" pile, (c) 0, (d) 0. 

5. (a) 3, add nickel to 3" pile, (b) (0, 0), (0, 4), (0, 6), (0, 9), C, 1), 

(2, 2), (2, 5), (2, 8), (3, 3), 3, 7), (4, 4), (4, 6), (5, 5). 7. If c; = d;, then 
trivially c' +c, =c' +d; ifc’ +c) =’ + dj, then c; and d, must have 
1s in the same positions in their binary representations, that is, they must be 
equal. 9. Immediately the proposed strategy works. 11. (a) Remove 
3 balls along one of the 3 lines formed by balls on one of the 3 sides of the 
arrangement. 


APPENDICES SOLUTIONS 


Section A.l 


1. (a) 12,27 (b) 2, 3, 6, 7, 9, 12, 15, 17, 18, 21, 22, 24, 27, 

(c) 1, 4, 5, 8, 10, 11, 13, 14, 16, 19, 20, 23, 25, 26, 28, 29, 

(d) all 1 Sk S 29 except 12, 27. 3. We are given N(R M M) = 2 
as well as that N(M) = N(R) = N(M) = N(R) = 4; then N(R NM) = 
N(M) —- NRO M)=4-2=2,NROM) =NR) - NRNM) = 
4 — 2 = 2, and clearly N(R MN M) = 2. 5. (a) impossible, 

(b) yes, 20-8 —8 =4, (c) 20-15 =S. 


N 


(c) (AUB) NM (ANB) = (ANB), see Figure Al.3, (d) A — (B — A) =A; 
9. here all expression involve M, so that ABC = 


Ln ANBNC. 
9 
& 


ABC 
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17. (a) E=SUHUC, 52 — 39%, 

(b) E =(S UHUC)N GNANC), 525 — 393 — 133, 

(c) E=(SNHNC)UGSNANC)U CNANO),3 X 13? X 39. 
19. (a) (p,q, r) = (TT, T), (7, T, F), (F, T, F), (T, F, F), (F, F, F), 
(b) (p, 4, r) = (F, F,T), (7, T, F), (F, T, F), (T, F, F), (F, F, F), 

(c) (p,q,r) =(F,F,F). 21. 2 


Section A.2 


23. One can prove by induction only a property that is a function of n; for 
example, one can prove that there are a finite number of binary sequences of 
length = n. 25. The initial step only assumes that n = 1, not n = 2, but 
for n = 1, a"? is undefined. 


Section A.3 


1. 1/2. 3. (a) 1/6, (b) 18/36 = 1/2, (ce) 3/36 = 1/12. 5. (a) 1/6, 
(b) 1/2. 7. (a) 1/3 X 1/3 = 1/9, (b) 2 X 1/3 X 2/3 = 4/9. 

9, (2X 2 X 2)/4! = 1/3. 11. 1 — [(50 + 40 — 20)/100] = 3/10. 

13. 2/3. 

15. (a) all sequences with k tails, 0 < k = 8, and one head followed by a 
head, (b) all positive integers, (c) all ordered pairs of positive integers, 
(d) all sequences of k black balls, k = 0, followed by a red ball. 


Section A.4 


lntl. 
13. Printer i is connected to computers j,i + 1,i+ 2,i+3,i+4,i+ 5. 


INDEX 


A 
Addition Principle, 172 
Adjacency matrix, 49, 110 
Adjacent vertices, 3 
Adler, I., 315 
Aho, A., 49, 132 
Ahuja, R., 166, 167 
Algorithmic proof procedure, 413 
Ancestor in a tree, 94, 426 
Anderson, I., 429 
Appel, K., 36, 76, 88, 92 
Arrangement(s), 180, 423 

with repetition, 196 

Art Gallery Problem, 87 
Ascent in permutation, 346 
Assignment problem, 126 
Augmenting flow algorithm, 144 
Augmenting matching algorithm, 

157 
Automorphism of graph, 54 


B 
Backtracking in graph, 103, 426 
Balanced tree, 96, 426 
Ball, W., 403 
Barnette, D., 92 
Behzad, M., 429 
Berge, C., 277, 399, 403, 429, 430 
Berlekamp, E., 403 
Berman, G., 429 
Beroulli, Jacob, 231 
Bernoulli, Jacques, 231, 417 
Biggs, N., 48, 92, 429 
Binomial coefficient, 181, 213, 423 
generalized, 310 
Binomial identities, 213 
Binomial Theorem, 214 


Bipartite graph, 4, 31, 75, 156, 423 
deficiency, 165 

Birthday Paradox, 192 

Bishops, dominating, 16 

Blockwalking, 216 

Boat crossing puzzle, 109 

Bogart, K., 429 

Bondy, J., 48, 92, 429 

Boole, G., 408 

Boolean algebra, 406 

Bouton, C., 403 

Boyer, C., 409 

Branch-and-bound search, 118 

Breadth-first search, 104, 426 

Bridge in a graph, 52 

Bridge probabilities, 203 

Brook’s Theorem, 88 

Brualdi, R., 427 

Bubble sort, 127 

Buck, R., 188, 231 

Burnside’s Lemma, 361 

Bussey, W., 413 


C 

C(n,r), 180 
Capacity: 

of cut, 139 

of edge, 138 
Capobianco, M., 429 
Cardano, B., 417 
Catalan number, 311 
Cayley, A., 48, 99, 132 
Center of a tree, 101 
Chain in network, 143 
Characteristic equation, 297, 308 
Characteristic sequence of a tree, 109 


Chessboard, generalized, 339 
457 | 
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Chromatic number, 77, 423 
Chromatic polynomial, 83, 91, 186, 
284 
Circle-chord method, 37 
Circuit, in graph, 4, 423 
Clique, 54 
Closure in group, 355 
Coalition voting, 188 
Cohen, D., 419, 429 
Cohen, J., 48 
Coin balancing, 98 
Color critical graph, 83, 90 
Coloring a graph, 77, 331, 423 
Combination, 180, 423 
algorithm for, 226 
Committee scheduling, 8, 79 
Complement: 
of chessboard, 347 
of graph, 23, 83, 90, 424 
of set, 406 
Complete graph, 21, 29, 423 
Component of graph, 46, 424 
Configuration in graph, 39 
Conjugate of Ferrers graph, 263 
Conjunction of propositions, 407 
Connected graph, 41, 424 
algorithm for connectedness, 110 
Conway, J., 403 
Cormen, T., 295, 315 
Coxeter, H., 403 
Crossing number, 45 
Cut in network, 138 
a—z cut, 138 
Cut-set, 53, 154 
Cycle in a graph, 53, 57, 424 
Cycle index, 368 
Cycle in permutation, 354 
Cycle structure representation, 367 


D 
David, F., 231, 418 
Deadheading edge, 59 
DeCarteblanche, F., 403 
Deficiency of bipartite graph, 165 


Degree of a region, 42 

Degree of a vertex, 20 
in-degree and out-degree, 23 

De Moivre, A., 276, 315, 348 

De Morgan, A., 408, 413 

Deo, N., 231, 232 

Depth-first search, 103, 426 

Derangement, 330, 424 

Descendant in a tree, 94 

Dictionary search, 6, 97 

Difference equation, 287, 315 

Difference of sets, 406 

Digital sum, 398 

Dijkstra, O., 133 

Direct sum of graph, 397 

Dirichlet drawer principle, 420 

Disjunction of propositions, 408 

Distribution, 202, 424 

Divide-and-conquer relations, 293 

DNA model, 9, 198 

Dominating set, 54 

Dual graph, 36 

Durfee square, 265 


E 

Edge cover, 7, 16, 158, 424 
Edge in graph, 3 

directed edge, 3 
Edge chromatic number, 88 
Elements of set, 405 
Equitable coloring, 83 
Equivalence class, 352 
Equivalence relation, 352 
Euler, L., 41, 48, 57, 277 
Euler cycle, 57, 72, 91, 424 
Euler trail, 61 
Euler’s constant e, 188, 268, 330 
Euler’s formula for graphs, 41, 236 
Event of outcomes, 416 

compound event, 416 

elementary event, 416 
Expected value of random variable, 

259 

Experiment, 416 


F 
Factor in graph, 383 
labeled factor, 383 
Feller, W., 276, 277 
Fermat, P., 231, 417 
Ferrers diagram, 262 
Fibonacci, 315 
Fibonacci numbers, 276, 281, 308, 
315 
Fibonacci relation, 281, 308, 299, 315 
Fleury’s algorithm, 64 
Flow chart, 14 
Flow in network, 138 
dynamic flow, 149 
value of flow, 139 
in a planar network, 155 
Floyd’s algorithm for shortest paths, 
135 
Ford, L., 166, 167 
Forest of trees, 101 
Four color problem, 36, 76, 88, 92 
Fourier transform, 276 
Fryer, K., 429 
Fulkerson, D., 166, 167 
Functional equation, 306 


G 
Galileo, 417 
Garbage collection, 81 
Generating functions, 243, 306, 
424 
exponential, 267 
ordinary, 243 
Generators of group, 358 


Gould, R., 430 
Graham, R., 419, 429 
Graph, 3 


directed graph, 3, 19, 387, 424 
Graver, J., 430 
Gray code, 71 
Group of symmetries, 356 
Grundy, P., 403 
Grundy function, 392, 398 
Guy, R., 403 
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H 
Haken, W., 36, 76, 88,92 ~ 
Hall, M., 430 


Hall’s Marriage Theorem, 159 

Halmos, P., 409 

Hamilton, W., 92 

Hamiltonian circuit and path, 65, 91, 
118, 424 

Harary, F., 49, 380, 430 

Heap, 129 

Heap sort, 129 

Height of tree, 96, 129 

Hillier, F., 167 

Hit polynomial, 336 

Hopcroft, J., 49, 132 

Hypercube, 72 


I 

Identical Objects Rule, 417 

Identity of group, 355 

Inclusion—exclusion formula, 325 

Independent set, 77, 425 

Induction, 411 

Initial conditions in recurrence relation, 
279 

Instant Insanity puzzle, 381 

Integer solution of equation, 205, 245, 
328 

Interest problems, 283 

Internal vertex, 94 

Intersection of sets, 406 

Interval graph, 9, 10, 27, 28, 80, 91, 
425 

unit-interval graph, 17 

Inverse in a group, 355 

Isolated vertices, 22 

Isomers of organic compounds, 358 

Isomorphism of graphs, 20, 425 


J 
Jackson, B., 429 


K 
K,, 2 
Kayles game, 395 
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Kernel of game, 390 

Kiefer, J., 315 

Kirchhoff, G., 48, 132 

Knight’s tour, 63 

Knuth, D., 413, 429 

Konig, D., 48 

Konigsberg bridges, 57 

Konig’s Infinity Lemma, 387 

Kruskal’s minimal spanning tree algo- 
rithm, 112 


L 
Laplace, S., 277, 415, 417 
Laplace transform, 276 
Lawler, E., 132 
Leaves of tree, 94, 426 
Lehmer, D., 231, 232 
Leibnitz, G., 231 
Length of path or circuit, 31 
Lesniak-Foster, L., 429 
Leiserson, C., 295, 315 
Lenstra, J., 132 
Level numbers in a progressively finite 

graph, 391 
Level numbers in tree, 94, 426 
Lexicographic order, 224 
Lieberman, G., 167 
Linear program, 166 
Line graph, 47, 54, 63, 76 
Linked list, 50 
Liu, C., 429 
Lloyd, E., 48, 92, 429 
Lovasz, L., 429 
Lucas, E., 132 


M 
MacMahon, P., 277 
Magnanti, T., 167 
Map coloring, 36, 425 
Matching in graph, 4, 156, 425 
maximal, 156 
X-matching, 156 
Matching network, 157 
Maurolycus, 413 
Max flow-min cut theorem, 147 


Maze searching, 104 
Member of set, 405 
Menage, 347 
Merge sort, 128 
Minimal spanning tree, 111 
Molluzzo, J., 429 
Moments of random variable, 259 
kth moment, 272 
Montmort, P., 348 
Mounting climbing, 29 
Multigraph, 57, 425 
Multiplication, fast, 295 
Multiplication Principle, 172 
Murty, U., 48, 92, 429 


N 
Negation of proposition, 408 
Network, 133, 425 
Network flow, 138 
Nievergelt, J., 231, 232 
Nijenhuis, A., 430 
Nim game, 396 
Nonconsecutive arrangements, 207 
Nonplanar graph, 46 


O 
Organic compounds, 358 
Ore, O., 430 
Orlin, J., 167 
O’Rourke, J., 87, 92 


P 

P(n,r), 180 
Palmer, E., 380 
Parent in a tree, 94, 427 
Parenthesization, 310 
Parsing, 18 
Partitions, 260, 425 

of integer, 260 

ordered, 265 

self-conjugate, 265 
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